Dataset info
| Number of variables | 181 |
|---|---|
| Number of observations | 2000 |
| Missing cells | 240887 (66.5%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 2.7 MiB |
| Average record size in memory | 1.4 KiB |
Variables types
| Numeric | 47 |
|---|---|
| Categorical | 26 |
| Boolean | 16 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 91 |
| Unsupported | 0 |
Warnings
coligada_mais_antiga_ativa has 1737 (86.9%) missing values | Missing |
coligada_mais_antiga_baixada has constant value "nan" | Rejected |
coligada_mais_nova_ativa has 1737 (86.9%) missing values | Missing |
coligada_mais_nova_baixada has constant value "nan" | Rejected |
de_faixa_faturamento_estimado has 118 (5.9%) missing values | Missing |
de_faixa_faturamento_estimado_grupo has 118 (5.9%) missing values | Missing |
de_indicador_telefone has constant value "BOA" | Rejected |
de_nivel_atividade has 51 (2.5%) missing values | Missing |
de_saude_rescencia has 64 (3.2%) missing values | Missing |
de_saude_tributaria has 64 (3.2%) missing values | Missing |
dt_situacao only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
dt_situacao has a high cardinality: 1315 distinct values | Warning |
empsetorcensitariofaixarendapopulacao has 600 (30.0%) missing values | Missing |
faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
faturamento_est_coligados_gp is highly correlated with faturamento_est_coligados (ρ = 0.95095) | Rejected |
fl_epp has constant value "False" | Rejected |
fl_optante_simei has 346 (17.3%) missing values | Missing |
fl_optante_simples has 346 (17.3%) missing values | Missing |
fl_spa has constant value "False" | Rejected |
fl_st_especial has constant value "False" | Rejected |
grau_instrucao_macro_analfabeto has 1994 (99.7%) missing values | Missing |
grau_instrucao_macro_desconhecido has constant value "nan" | Rejected |
grau_instrucao_macro_escolaridade_fundamental is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.99804) | Rejected |
grau_instrucao_macro_escolaridade_media has 1694 (84.7%) missing values | Missing |
grau_instrucao_macro_escolaridade_superior is highly correlated with grau_instrucao_macro_escolaridade_media (ρ = 0.91788) | Rejected |
idade_acima_de_58 is highly correlated with grau_instrucao_macro_analfabeto (ρ = 1) | Rejected |
idade_ate_18 has 1990 (99.5%) missing values | Missing |
idade_de_19_a_23 has 1879 (94.0%) missing values | Missing |
idade_de_24_a_28 is highly correlated with idade_de_19_a_23 (ρ = 0.90867) | Rejected |
idade_de_29_a_33 is highly correlated with grau_instrucao_macro_escolaridade_superior (ρ = 0.97492) | Rejected |
idade_de_34_a_38 is highly correlated with idade_de_29_a_33 (ρ = 0.99405) | Rejected |
idade_de_39_a_43 is highly correlated with idade_de_34_a_38 (ρ = 0.98349) | Rejected |
idade_de_44_a_48 is highly correlated with idade_de_39_a_43 (ρ = 0.94917) | Rejected |
idade_de_49_a_53 is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.98916) | Rejected |
idade_de_54_a_58 is highly correlated with idade_de_49_a_53 (ρ = 0.96175) | Rejected |
idade_maxima_coligadas is highly correlated with coligada_mais_antiga_ativa (ρ = 0.99976) | Rejected |
idade_maxima_socios has 659 (33.0%) missing values | Missing |
idade_media_coligadas has 1737 (86.9%) missing values | Missing |
idade_media_coligadas_ativas is highly correlated with idade_media_coligadas (ρ = 0.99964) | Rejected |
idade_media_coligadas_baixadas has constant value "nan" | Rejected |
idade_media_socios is highly correlated with idade_maxima_socios (ρ = 0.95544) | Rejected |
idade_minima_coligadas is highly correlated with coligada_mais_nova_ativa (ρ = 1) | Rejected |
idade_minima_socios is highly correlated with idade_media_socios (ρ = 0.95058) | Rejected |
max_faturamento_est_coligados is highly correlated with faturamento_est_coligados_gp (ρ = 0.93599) | Rejected |
max_faturamento_est_coligados_gp is highly correlated with max_faturamento_est_coligados (ρ = 0.95994) | Rejected |
max_filiais_coligados has 1919 (96.0%) missing values | Missing |
max_funcionarios_coligados_gp has 28 (1.4%) zeros | Zeros |
max_funcionarios_coligados_gp has 1830 (91.5%) missing values | Missing |
max_meses_servicos is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.96649) | Rejected |
max_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.96649) | Rejected |
max_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
max_vl_folha_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.90484) | Rejected |
media_faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
media_faturamento_est_coligados_gp is highly correlated with media_faturamento_est_coligados (ρ = 0.95038) | Rejected |
media_filiais_coligados has 1919 (96.0%) missing values | Missing |
media_funcionarios_coligados_gp has 28 (1.4%) zeros | Zeros |
media_funcionarios_coligados_gp has 1830 (91.5%) missing values | Missing |
media_meses_servicos is highly correlated with max_meses_servicos (ρ = 0.98685) | Rejected |
media_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.97656) | Rejected |
media_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
media_vl_folha_coligados_gp is highly correlated with media_funcionarios_coligados_gp (ρ = 0.90412) | Rejected |
meses_ultima_contratacaco has 1533 (76.6%) missing values | Missing |
min_faturamento_est_coligados has 1738 (86.9%) missing values | Missing |
min_faturamento_est_coligados_gp has 1738 (86.9%) missing values | Missing |
min_filiais_coligados is highly correlated with min_faturamento_est_coligados_gp (ρ = 0.94222) | Rejected |
min_funcionarios_coligados_gp is highly correlated with min_filiais_coligados (ρ = 0.90511) | Rejected |
min_meses_servicos is highly correlated with meses_ultima_contratacaco (ρ = 0.9357) | Rejected |
min_meses_servicos_all is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.98306) | Rejected |
min_vl_folha_coligados has 1858 (92.9%) missing values | Missing |
min_vl_folha_coligados_gp is highly correlated with min_vl_folha_coligados (ρ = 0.98176) | Rejected |
nm_divisao has a high cardinality: 72 distinct values | Warning |
nm_meso_regiao has 278 (13.9%) missing values | Missing |
nm_micro_regiao has a high cardinality: 73 distinct values | Warning |
nm_micro_regiao has 278 (13.9%) missing values | Missing |
nu_meses_rescencia has 199 (10.0%) missing values | Missing |
percent_func_genero_fem is highly correlated with grau_instrucao_macro_analfabeto (ρ = 0.94572) | Rejected |
percent_func_genero_masc has 83 (4.2%) zeros | Zeros |
percent_func_genero_masc has 1659 (83.0%) missing values | Missing |
qt_admitidos has 1533 (76.6%) missing values | Missing |
qt_admitidos_12meses has 342 (17.1%) zeros | Zeros |
qt_admitidos_12meses has 1533 (76.6%) missing values | Missing |
qt_alteracao_socio_180d has constant value "nan" | Rejected |
qt_alteracao_socio_365d has constant value "nan" | Rejected |
qt_alteracao_socio_90d has constant value "nan" | Rejected |
qt_alteracao_socio_total has constant value "nan" | Rejected |
qt_art has 1974 (98.7%) missing values | Missing |
qt_coligadas has 1791 (89.5%) missing values | Missing |
qt_coligados is highly correlated with qt_coligadas (ρ = 0.99508) | Rejected |
qt_coligados_agropecuaria has 1737 (86.9%) missing values | Missing |
qt_coligados_atividade_alto has constant value "0.0" | Rejected |
qt_coligados_atividade_baixo has constant value "0.0" | Rejected |
qt_coligados_atividade_inativo has constant value "0.0" | Rejected |
qt_coligados_atividade_medio has constant value "0.0" | Rejected |
qt_coligados_atividade_mt_baixo has constant value "0.0" | Rejected |
qt_coligados_ativo is highly correlated with qt_coligados (ρ = 0.99225) | Rejected |
qt_coligados_baixada is highly correlated with min_vl_folha_coligados_gp (ρ = 0.97761) | Rejected |
qt_coligados_ccivil has 212 (10.6%) zeros | Zeros |
qt_coligados_ccivil has 1737 (86.9%) missing values | Missing |
qt_coligados_centro has 255 (12.8%) zeros | Zeros |
qt_coligados_centro has 1737 (86.9%) missing values | Missing |
qt_coligados_comercio has 144 (7.2%) zeros | Zeros |
qt_coligados_comercio has 1737 (86.9%) missing values | Missing |
qt_coligados_epp has constant value "0.0" | Rejected |
qt_coligados_exterior has 1737 (86.9%) missing values | Missing |
qt_coligados_inapta has 1737 (86.9%) missing values | Missing |
qt_coligados_industria has 229 (11.5%) zeros | Zeros |
qt_coligados_industria has 1737 (86.9%) missing values | Missing |
qt_coligados_ltda has 1737 (86.9%) missing values | Missing |
qt_coligados_matriz is highly correlated with qt_coligados_ativo (ρ = 0.99409) | Rejected |
qt_coligados_me has 1737 (86.9%) missing values | Missing |
qt_coligados_mei has 1737 (86.9%) missing values | Missing |
qt_coligados_nordeste has 93 (4.7%) zeros | Zeros |
qt_coligados_nordeste has 1737 (86.9%) missing values | Missing |
qt_coligados_norte is highly correlated with idade_ate_18 (ρ = 0.90019) | Rejected |
qt_coligados_nula has constant value "0.0" | Rejected |
qt_coligados_sa is highly correlated with qt_coligados_matriz (ρ = 0.9177) | Rejected |
qt_coligados_serviço has 96 (4.8%) zeros | Zeros |
qt_coligados_serviço has 1737 (86.9%) missing values | Missing |
qt_coligados_sudeste is highly correlated with qt_coligados_sa (ρ = 0.91504) | Rejected |
qt_coligados_sul has 258 (12.9%) zeros | Zeros |
qt_coligados_sul has 1737 (86.9%) missing values | Missing |
qt_coligados_suspensa has constant value "0.0" | Rejected |
qt_desligados has 54 (2.7%) zeros | Zeros |
qt_desligados has 1533 (76.6%) missing values | Missing |
qt_desligados_12meses has 332 (16.6%) zeros | Zeros |
qt_desligados_12meses has 1533 (76.6%) missing values | Missing |
qt_ex_funcionarios is highly correlated with qt_desligados (ρ = 1) | Rejected |
qt_filiais is highly skewed (γ1 = 27.833) | Skewed |
qt_filiais has 1818 (90.9%) zeros | Zeros |
qt_funcionarios is highly correlated with idade_de_44_a_48 (ρ = 0.95213) | Rejected |
qt_funcionarios_12meses is highly correlated with qt_funcionarios (ρ = 0.987) | Rejected |
qt_funcionarios_24meses is highly correlated with qt_funcionarios_12meses (ρ = 0.98125) | Rejected |
qt_funcionarios_coligados is highly correlated with qt_coligados_sudeste (ρ = 0.90732) | Rejected |
qt_funcionarios_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.97393) | Rejected |
qt_funcionarios_grupo is highly correlated with qt_filiais (ρ = 0.90038) | Rejected |
qt_ramos_coligados is highly correlated with qt_coligadas (ρ = 0.94424) | Rejected |
qt_regioes_coligados has 1737 (86.9%) missing values | Missing |
qt_socios has 502 (25.1%) missing values | Missing |
qt_socios_coligados is highly correlated with qt_funcionarios_coligados (ρ = 0.91132) | Rejected |
qt_socios_feminino has 1377 (68.8%) missing values | Missing |
qt_socios_masculino has 1139 (57.0%) missing values | Missing |
qt_socios_pep is highly correlated with qt_socios_masculino (ρ = 0.98678) | Rejected |
qt_socios_pf is highly correlated with qt_socios_pep (ρ = 0.93094) | Rejected |
qt_socios_pj has 502 (25.1%) missing values | Missing |
qt_socios_pj_ativos is highly correlated with qt_socios_pj (ρ = 1) | Rejected |
qt_socios_pj_baixados has constant value "0.0" | Rejected |
qt_socios_pj_inaptos has constant value "0.0" | Rejected |
qt_socios_pj_nulos has constant value "0.0" | Rejected |
qt_socios_pj_suspensos has constant value "0.0" | Rejected |
qt_socios_st_regular is highly correlated with qt_socios_pf (ρ = 0.96491) | Rejected |
qt_socios_st_suspensa has 1986 (99.3%) missing values | Missing |
qt_ufs_coligados has 1737 (86.9%) missing values | Missing |
sum_faturamento_estimado_coligadas is highly correlated with media_faturamento_est_coligados_gp (ρ = 0.99843) | Rejected |
total is highly correlated with qt_funcionarios_24meses (ρ = 0.99141) | Rejected |
total_filiais_coligados is highly correlated with qt_socios_coligados (ρ = 0.96516) | Rejected |
tx_crescimento_12meses has 210 (10.5%) zeros | Zeros |
tx_crescimento_12meses has 1658 (82.9%) missing values | Missing |
tx_crescimento_24meses has 131 (6.6%) zeros | Zeros |
tx_crescimento_24meses has 1646 (82.3%) missing values | Missing |
tx_rotatividade has 362 (18.1%) zeros | Zeros |
tx_rotatividade has 1533 (76.6%) missing values | Missing |
vl_faturamento_estimado_aux is highly correlated with total (ρ = 0.98498) | Rejected |
vl_faturamento_estimado_grupo_aux is highly correlated with qt_socios_st_suspensa (ρ = 0.94871) | Rejected |
vl_folha_coligados is highly correlated with qt_socios_pep (ρ = 0.94709) | Rejected |
vl_folha_coligados_gp is highly correlated with vl_folha_coligados (ρ = 0.92398) | Rejected |
vl_frota has 1885 (94.2%) missing values | Missing |
vl_idade_maxima_socios_pj has 1981 (99.1%) missing values | Missing |
vl_idade_media_socios_pj is highly correlated with vl_idade_maxima_socios_pj (ρ = 0.98045) | Rejected |
vl_idade_minima_socios_pj is highly correlated with vl_idade_media_socios_pj (ρ = 0.98102) | Rejected |
vl_potenc_cons_oleo_gas is highly correlated with vl_folha_coligados_gp (ρ = 0.99995) | Rejected |
vl_total_tancagem has constant value "nan" | Rejected |
vl_total_tancagem_grupo is highly correlated with vl_folha_coligados_gp (ρ = 1) | Rejected |
vl_total_veiculos_antt has constant value "nan" | Rejected |
vl_total_veiculos_antt_grupo has constant value "nan" | Rejected |
vl_total_veiculos_leves has 32 (1.6%) zeros | Zeros |
vl_total_veiculos_leves has 1864 (93.2%) missing values | Missing |
vl_total_veiculos_leves_grupo is highly skewed (γ1 = 44.237) | Skewed |
vl_total_veiculos_leves_grupo has 1834 (91.7%) zeros | Zeros |
vl_total_veiculos_pesados is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.91186) | Rejected |
vl_total_veiculos_pesados_grupo is highly correlated with vl_potenc_cons_oleo_gas (ρ = 0.90934) | Rejected |
coligada_mais_antiga_ativa
Numeric
| Distinct count | 261 |
|---|---|
| Unique (%) | 13.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 219.55 |
|---|---|
| Minimum | 1.7667 |
| Maximum | 636.9 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.7667 |
|---|---|
| 5-th percentile | 39.197 |
| Q1 | 117.15 |
| Median | 193.97 |
| Q3 | 280.18 |
| 95-th percentile | 538.3 |
| Maximum | 636.9 |
| Range | 635.13 |
| Interquartile range | 163.03 |
Descriptive statistics
| Standard deviation | 145.12 |
|---|---|
| Coef of variation | 0.661 |
| Kurtosis | 0.94297 |
| Mean | 219.55 |
| MAD | 111.28 |
| Skewness | 1.0926 |
| Sum | 57741 |
| Variance | 21060 |
| Memory size | 31.2 KiB |
| Value | Count | Frequency (%) | |
| 247.5 | 2 | 0.1% | |
| 231.13 | 2 | 0.1% | |
| 243.7 | 2 | 0.1% | |
| 262.6 | 1 | 0.1% | |
| 71.567 | 1 | 0.1% | |
| 538.47 | 1 | 0.1% | |
| 113.07 | 1 | 0.1% | |
| 294.77 | 1 | 0.1% | |
| 163.27 | 1 | 0.1% | |
| 358.97 | 1 | 0.1% | |
| Other values (250) | 250 | 12.5% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.7667 | 1 | 0.1% | |
| 5.2667 | 1 | 0.1% | |
| 11.267 | 1 | 0.1% | |
| 16.933 | 1 | 0.1% | |
| 23.767 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 636.9 | 1 | 0.1% | |
| 636.6 | 1 | 0.1% | |
| 636.03 | 1 | 0.1% | |
| 635.8 | 1 | 0.1% | |
| 635.07 | 1 | 0.1% |
coligada_mais_antiga_baixada
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
coligada_mais_nova_ativa
Numeric
| Distinct count | 253 |
|---|---|
| Unique (%) | 12.7% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 116.72 |
|---|---|
| Minimum | 1.5333 |
| Maximum | 634.4 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.5333 |
|---|---|
| 5-th percentile | 4.8767 |
| Q1 | 42.3 |
| Median | 87.8 |
| Q3 | 176.35 |
| 95-th percentile | 307.98 |
| Maximum | 634.4 |
| Range | 632.87 |
| Interquartile range | 134.05 |
Descriptive statistics
| Standard deviation | 99.6 |
|---|---|
| Coef of variation | 0.85335 |
| Kurtosis | 2.3595 |
| Mean | 116.72 |
| MAD | 79.01 |
| Skewness | 1.3048 |
| Sum | 30696 |
| Variance | 9920.2 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 43.7 | 3 | 0.1% | |
| 86.067 | 2 | 0.1% | |
| 50.8 | 2 | 0.1% | |
| 37.933 | 2 | 0.1% | |
| 37.3 | 2 | 0.1% | |
| 2.7 | 2 | 0.1% | |
| 96.267 | 2 | 0.1% | |
| 243.7 | 2 | 0.1% | |
| 4.8333 | 2 | 0.1% | |
| 3.4 | 2 | 0.1% | |
| Other values (242) | 242 | 12.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.5333 | 1 | 0.1% | |
| 1.7667 | 1 | 0.1% | |
| 2.4667 | 1 | 0.1% | |
| 2.5 | 1 | 0.1% | |
| 2.7 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 634.4 | 1 | 0.1% | |
| 407.73 | 1 | 0.1% | |
| 389.97 | 1 | 0.1% | |
| 369.33 | 1 | 0.1% | |
| 365.57 | 1 | 0.1% |
coligada_mais_nova_baixada
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
de_faixa_faturamento_estimado
Categorical
| Distinct count | 10 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 5.9% |
| Missing (n) | 118 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 200 |
| Other values (6) | 73 |
| (Missing) | 118 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 1166 | 58.3% | |
| ATE R$ 81.000,00 | 443 | 22.1% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 200 | 10.0% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 51 | 2.5% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 6 | 0.3% | |
| SEM INFORMACAO | 6 | 0.3% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 6 | 0.3% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 3 | 0.1% | |
| DE R$ 300.000.000,01 A R$ 500.000.000,00 | 1 | 0.1% | |
| (Missing) | 118 | 5.9% |
| Max length | 40 |
|---|---|
| Mean length | 26.457 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_faixa_faturamento_estimado_grupo
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.6% |
| Missing (%) | 5.9% |
| Missing (n) | 118 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | |
| Other values (8) | 116 |
| (Missing) | 118 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 1094 | 54.7% | |
| ATE R$ 81.000,00 | 438 | 21.9% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 234 | 11.7% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 51 | 2.5% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 16 | 0.8% | |
| ACIMA DE 1 BILHAO DE REAIS | 16 | 0.8% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 12 | 0.6% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 8 | 0.4% | |
| DE R$ 100.000.000,01 A R$ 300.000.000,00 | 6 | 0.3% | |
| DE R$ 500.000.000,01 A 1 BILHAO DE REAIS | 5 | 0.2% | |
| (Missing) | 118 | 5.9% |
| Max length | 40 |
|---|---|
| Mean length | 26.682 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_indicador_telefone
Constant
This variable is constant and should be ignored for analysis
| Constant value | BOA |
|---|
de_natureza_juridica
Categorical
| Distinct count | 33 |
|---|---|
| Unique (%) | 1.7% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| EMPRESARIO INDIVIDUAL | |
|---|---|
| SOCIEDADE EMPRESARIA LIMITADA | |
| ASSOCIACAO PRIVADA | 115 |
| Other values (30) | 183 |
| Value | Count | Frequency (%) | |
| EMPRESARIO INDIVIDUAL | 1306 | 65.3% | |
| SOCIEDADE EMPRESARIA LIMITADA | 396 | 19.8% | |
| ASSOCIACAO PRIVADA | 115 | 5.8% | |
| EMPRESA INDIVIDUAL DE RESPONSABILIDADE LIMITADA DE NATUREZA EMPRESARIA | 68 | 3.4% | |
| ORGAO DE DIRECAO LOCAL DE PARTIDO POLITICO | 24 | 1.2% | |
| CANDIDATO A CARGO POLITICO ELETIVO | 12 | 0.6% | |
| ENTIDADE SINDICAL | 7 | 0.4% | |
| CONDOMINIO EDILICIO | 7 | 0.4% | |
| ORGANIZACAO RELIGIOSA | 7 | 0.4% | |
| SOCIEDADE SIMPLES LIMITADA | 6 | 0.3% | |
| Other values (23) | 52 | 2.6% |
| Max length | 70 |
|---|---|
| Mean length | 24.597 |
| Min length | 9 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_nivel_atividade
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 2.5% |
| Missing (n) | 51 |
| MEDIA | |
|---|---|
| ALTA | |
| BAIXA | |
| (Missing) | 51 |
| Value | Count | Frequency (%) | |
| MEDIA | 933 | 46.7% | |
| ALTA | 680 | 34.0% | |
| BAIXA | 321 | 16.1% | |
| MUITO BAIXA | 15 | 0.8% | |
| (Missing) | 51 | 2.5% |
| Max length | 11 |
|---|---|
| Mean length | 4.654 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_ramo
Categorical
| Distinct count | 31 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO VAREJISTA | |
|---|---|
| SERVICOS DIVERSOS | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 133 |
| Other values (28) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 749 | 37.5% | |
| SERVICOS DIVERSOS | 229 | 11.5% | |
| SERVICOS DE ALOJAMENTO/ALIMENTACAO | 133 | 6.7% | |
| INDUSTRIA DA CONSTRUCAO | 122 | 6.1% | |
| COMERCIO E REPARACAO DE VEICULOS | 115 | 5.8% | |
| SERVICOS ADMINISTRATIVOS | 102 | 5.1% | |
| BENS DE CONSUMO | 85 | 4.2% | |
| SERVICOS PROFISSIONAIS, TECNICOS E CIENTIFICOS | 78 | 3.9% | |
| COMERCIO POR ATACADO | 63 | 3.1% | |
| TRANSPORTE, ARMAZENAGEM E CORREIO | 59 | 2.9% | |
| Other values (21) | 265 | 13.2% |
| Max length | 49 |
|---|---|
| Mean length | 22.175 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_saude_rescencia
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 3.2% |
| Missing (n) | 64 |
| ACIMA DE 1 ANO | |
|---|---|
| ATE 1 ANO | 149 |
| SEM INFORMACAO | 135 |
| (Missing) | 64 |
| Value | Count | Frequency (%) | |
| ACIMA DE 1 ANO | 1652 | 82.6% | |
| ATE 1 ANO | 149 | 7.4% | |
| SEM INFORMACAO | 135 | 6.8% | |
| (Missing) | 64 | 3.2% |
| Max length | 14 |
|---|---|
| Mean length | 13.275 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_saude_tributaria
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 3.2% |
| Missing (n) | 64 |
| VERDE | |
|---|---|
| AZUL | |
| AMARELO | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| VERDE | 679 | 34.0% | |
| AZUL | 471 | 23.5% | |
| AMARELO | 351 | 17.5% | |
| CINZA | 262 | 13.1% | |
| LARANJA | 150 | 7.5% | |
| VERMELHO | 23 | 1.1% | |
| (Missing) | 64 | 3.2% |
| Max length | 8 |
|---|---|
| Mean length | 5.236 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
dt_situacao
Categorical
| Distinct count | 1315 |
|---|---|
| Unique (%) | 65.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2005-11-03 | 285 |
|---|---|
| 2006-12-01 | 13 |
| 2005-09-24 | 8 |
| Other values (1312) |
| Value | Count | Frequency (%) | |
| 2005-11-03 | 285 | 14.2% | |
| 2006-12-01 | 13 | 0.7% | |
| 2005-09-24 | 8 | 0.4% | |
| 2018-08-14 | 7 | 0.4% | |
| 2004-10-30 | 7 | 0.4% | |
| 2006-12-21 | 7 | 0.4% | |
| 2005-08-27 | 7 | 0.4% | |
| 2006-12-02 | 7 | 0.4% | |
| 2010-05-15 | 6 | 0.3% | |
| 1998-07-28 | 6 | 0.3% | |
| Other values (1305) | 1647 | 82.3% |
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
empsetorcensitariofaixarendapopulacao
Numeric
| Distinct count | 1231 |
|---|---|
| Unique (%) | 61.6% |
| Missing (%) | 30.0% |
| Missing (n) | 600 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1342.2 |
|---|---|
| Minimum | 169.28 |
| Maximum | 30862 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 169.28 |
|---|---|
| 5-th percentile | 449.45 |
| Q1 | 686.73 |
| Median | 943.72 |
| Q3 | 1539 |
| 95-th percentile | 3653.9 |
| Maximum | 30862 |
| Range | 30693 |
| Interquartile range | 852.3 |
Descriptive statistics
| Standard deviation | 1340.3 |
|---|---|
| Coef of variation | 0.99859 |
| Kurtosis | 173.63 |
| Mean | 1342.2 |
| MAD | 752.91 |
| Skewness | 9.2164 |
| Sum | 1.8791e+06 |
| Variance | 1.7965e+06 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1549.1 | 9 | 0.4% | |
| 1086 | 6 | 0.3% | |
| 845.75 | 4 | 0.2% | |
| 2522.8 | 4 | 0.2% | |
| 1375.7 | 4 | 0.2% | |
| 1116.5 | 4 | 0.2% | |
| 786.74 | 3 | 0.1% | |
| 452.13 | 3 | 0.1% | |
| 1939.1 | 3 | 0.1% | |
| 1519.6 | 3 | 0.1% | |
| Other values (1220) | 1357 | 67.8% | |
| (Missing) | 600 | 30.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 169.28 | 1 | 0.1% | |
| 205.66 | 1 | 0.1% | |
| 247.88 | 1 | 0.1% | |
| 255.93 | 1 | 0.1% | |
| 262.44 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 30862 | 1 | 0.1% | |
| 12513 | 1 | 0.1% | |
| 10039 | 1 | 0.1% | |
| 7716.2 | 1 | 0.1% | |
| 6641.4 | 1 | 0.1% |
faturamento_est_coligados
Numeric
| Distinct count | 135 |
|---|---|
| Unique (%) | 6.8% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.251e+08 |
|---|---|
| Minimum | 50000 |
| Maximum | 2.7759e+10 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 50000 |
|---|---|
| 5-th percentile | 1.8668e+05 |
| Q1 | 2.1e+05 |
| Median | 6.8001e+05 |
| Q3 | 2.515e+06 |
| 95-th percentile | 1.5139e+08 |
| Maximum | 2.7759e+10 |
| Range | 2.7759e+10 |
| Interquartile range | 2.305e+06 |
Descriptive statistics
| Standard deviation | 2.6192e+09 |
|---|---|
| Coef of variation | 8.0565 |
| Kurtosis | 93.975 |
| Mean | 3.251e+08 |
| MAD | 6.1382e+08 |
| Skewness | 9.5289 |
| Sum | 8.5176e+10 |
| Variance | 6.86e+18 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 68 | 3.4% | |
| 4.2e+05 | 14 | 0.7% | |
| 9.3e+05 | 11 | 0.5% | |
| 6.3e+05 | 7 | 0.4% | |
| 1.8546e+05 | 6 | 0.3% | |
| 3.7092e+05 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 9.8911e+05 | 3 | 0.1% | |
| 1.2364e+05 | 3 | 0.1% | |
| 1.26e+06 | 3 | 0.1% | |
| Other values (124) | 137 | 6.9% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 50000 | 5 | 0.2% | |
| 1.2364e+05 | 3 | 0.1% | |
| 1.8546e+05 | 6 | 0.3% | |
| 2.1e+05 | 68 | 3.4% | |
| 2.4728e+05 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2.7759e+10 | 1 | 0.1% | |
| 2.7536e+10 | 1 | 0.1% | |
| 1.5086e+10 | 1 | 0.1% | |
| 7.2571e+09 | 1 | 0.1% | |
| 2.1435e+09 | 1 | 0.1% |
faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.95095 |
|---|
fl_antt
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 6 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1983 | 99.2% | |
| True | 6 | 0.3% | |
| (Missing) | 11 | 0.5% |
fl_email
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 1071 | 53.5% | |
| True | 929 | 46.5% |
fl_epp
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_ltda
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 3 |
| Value | Count | Frequency (%) | |
| False | 1997 | 99.9% | |
| True | 3 | 0.1% |
fl_matriz
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False | 107 |
| Value | Count | Frequency (%) | |
| True | 1893 | 94.7% | |
| False | 107 | 5.3% |
fl_me
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 4 |
| Value | Count | Frequency (%) | |
| False | 1996 | 99.8% | |
| True | 4 | 0.2% |
fl_mei
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 1339 | 67.0% | |
| True | 661 | 33.1% |
fl_optante_simei
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 17.3% |
| Missing (n) | 346 |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) | |
| False | 1233 | 61.7% | |
| True | 421 | 21.1% | |
| (Missing) | 346 | 17.3% |
fl_optante_simples
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 17.3% |
| Missing (n) | 346 |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) | |
| True | 913 | 45.6% | |
| False | 741 | 37.0% | |
| (Missing) | 346 | 17.3% |
fl_passivel_iss
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| True | |
|---|---|
| False | |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| True | 1141 | 57.0% | |
| False | 848 | 42.4% | |
| (Missing) | 11 | 0.5% |
fl_rm
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| NAO | |
|---|---|
| SIM |
| Value | Count | Frequency (%) | |
| NAO | 1008 | 50.4% | |
| SIM | 992 | 49.6% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
fl_sa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 36 |
| Value | Count | Frequency (%) | |
| False | 1964 | 98.2% | |
| True | 36 | 1.8% |
fl_simples_irregular
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 2 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1987 | 99.4% | |
| True | 2 | 0.1% | |
| (Missing) | 11 | 0.5% |
fl_spa
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_st_especial
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_telefone
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 1459 | 73.0% | |
| False | 541 | 27.1% |
fl_veiculo
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| False | |
|---|---|
| True | 136 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| False | 1853 | 92.7% | |
| True | 136 | 6.8% | |
| (Missing) | 11 | 0.5% |
grau_instrucao_macro_analfabeto
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.7% |
| Missing (n) | 1994 |
| 1 | 5 |
|---|---|
| 3 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.2% | |
| 3 | 1 | 0.1% | |
| (Missing) | 1994 | 99.7% |
grau_instrucao_macro_desconhecido
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
grau_instrucao_macro_escolaridade_fundamental
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.99804 |
|---|
grau_instrucao_macro_escolaridade_media
Numeric
| Distinct count | 37 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 84.7% |
| Missing (n) | 1694 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.6373 |
|---|---|
| Minimum | 1 |
| Maximum | 523 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 3 |
| Q3 | 6 |
| 95-th percentile | 24.5 |
| Maximum | 523 |
| Range | 522 |
| Interquartile range | 5 |
Descriptive statistics
| Standard deviation | 33.488 |
|---|---|
| Coef of variation | 3.8772 |
| Kurtosis | 185.92 |
| Mean | 8.6373 |
| MAD | 9.6794 |
| Skewness | 12.636 |
| Sum | 2643 |
| Variance | 1121.5 |
| Memory size | 31.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 89 | 4.5% | |
| 2 | 56 | 2.8% | |
| 3 | 34 | 1.7% | |
| 4 | 25 | 1.2% | |
| 5 | 17 | 0.9% | |
| 6 | 17 | 0.9% | |
| 9 | 7 | 0.4% | |
| 7 | 7 | 0.4% | |
| 8 | 6 | 0.3% | |
| 10 | 6 | 0.3% | |
| Other values (26) | 42 | 2.1% | |
| (Missing) | 1694 | 84.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 89 | 4.5% | |
| 2 | 56 | 2.8% | |
| 3 | 34 | 1.7% | |
| 4 | 25 | 1.2% | |
| 5 | 17 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 523 | 1 | 0.1% | |
| 177 | 1 | 0.1% | |
| 120 | 1 | 0.1% | |
| 94 | 1 | 0.1% | |
| 82 | 1 | 0.1% |
grau_instrucao_macro_escolaridade_superior
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_media and should be ignored for analysis
| Correlation | 0.91788 |
|---|
id
Categorical, Unique
| First 5 values |
|---|
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f838dc889b75c6f9af1 |
| 0032e3e6a776cbf4d36efa963b4eda224ddba8af284117273bbd7a2a9d374f96 |
| 0036afe9a1be13d5389ab0d8f8cd2217c8c6076345298cb451f3634f44294ae0 |
| 008fd7836462aaecf8b8d335153ada057577156875fae4933a9fc16a2a44e0d0 |
| 00c30fb1779762241dd1ca7d1baab00c24b2f87294e573415ccfa23fda43c270 |
| Last 5 values |
|---|
| ffba50aa6f4af7271d8f9738018b0f0329c89b05fe597100f833a07065a2c417 |
| ffc04bac625e91c7ebad8dc98943264661aca3a913b5f634c2ae69a947772e13 |
| ffd37fc0ee78a4555d0c18e10b3779e54581142875f0ad31f9ba0406fd5a9b2d |
| ffd81231de150c50400d79d1740ca8108eda5b339beb850646cdd9424bae405e |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f73092c10d21a1d454d1fd |
First 5 values
| Value | Count | Frequency (%) | |
| 00123b6e449556823ba4aac6dbb35b44f60557c511566f838dc889b75c6f9af1 | 1 | 0.1% | |
| 0032e3e6a776cbf4d36efa963b4eda224ddba8af284117273bbd7a2a9d374f96 | 1 | 0.1% | |
| 0036afe9a1be13d5389ab0d8f8cd2217c8c6076345298cb451f3634f44294ae0 | 1 | 0.1% | |
| 008fd7836462aaecf8b8d335153ada057577156875fae4933a9fc16a2a44e0d0 | 1 | 0.1% | |
| 00c30fb1779762241dd1ca7d1baab00c24b2f87294e573415ccfa23fda43c270 | 1 | 0.1% |
Last 5 values
| Value | Count | Frequency (%) | |
| ffed1e47eaf7b3444605cd7cb91bf9ef7cf3bbe9f7f73092c10d21a1d454d1fd | 1 | 0.1% | |
| ffd81231de150c50400d79d1740ca8108eda5b339beb850646cdd9424bae405e | 1 | 0.1% | |
| ffd37fc0ee78a4555d0c18e10b3779e54581142875f0ad31f9ba0406fd5a9b2d | 1 | 0.1% | |
| ffc04bac625e91c7ebad8dc98943264661aca3a913b5f634c2ae69a947772e13 | 1 | 0.1% | |
| ffba50aa6f4af7271d8f9738018b0f0329c89b05fe597100f833a07065a2c417 | 1 | 0.1% |
idade_acima_de_58
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 1 |
|---|
idade_ate_18
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 99.5% |
| Missing (n) | 1990 |
| 1 | 7 |
|---|---|
| 3 | 2 |
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 7 | 0.4% | |
| 3 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1990 | 99.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
idade_de_19_a_23
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 94.0% |
| Missing (n) | 1879 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.4628 |
|---|---|
| Minimum | 1 |
| Maximum | 43 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 43 |
| Range | 42 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 4.4479 |
|---|---|
| Coef of variation | 1.806 |
| Kurtosis | 58.704 |
| Mean | 2.4628 |
| MAD | 1.9587 |
| Skewness | 6.9266 |
| Sum | 298 |
| Variance | 19.784 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 19 | 0.9% | |
| 3 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 5 | 4 | 0.2% | |
| 11 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 6 | 1 | 0.1% | |
| Other values (2) | 2 | 0.1% | |
| (Missing) | 1879 | 94.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 19 | 0.9% | |
| 3 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 5 | 4 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 43 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
idade_de_24_a_28
Highly correlated
This variable is highly correlated with idade_de_19_a_23 and should be ignored for analysis
| Correlation | 0.90867 |
|---|
idade_de_29_a_33
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_superior and should be ignored for analysis
| Correlation | 0.97492 |
|---|
idade_de_34_a_38
Highly correlated
This variable is highly correlated with idade_de_29_a_33 and should be ignored for analysis
| Correlation | 0.99405 |
|---|
idade_de_39_a_43
Highly correlated
This variable is highly correlated with idade_de_34_a_38 and should be ignored for analysis
| Correlation | 0.98349 |
|---|
idade_de_44_a_48
Highly correlated
This variable is highly correlated with idade_de_39_a_43 and should be ignored for analysis
| Correlation | 0.94917 |
|---|
idade_de_49_a_53
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.98916 |
|---|
idade_de_54_a_58
Highly correlated
This variable is highly correlated with idade_de_49_a_53 and should be ignored for analysis
| Correlation | 0.96175 |
|---|
idade_emp_cat
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 a 5 | |
|---|---|
| 5 a 10 | |
| > 20 | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| 1 a 5 | 609 | 30.4% | |
| 5 a 10 | 523 | 26.2% | |
| > 20 | 329 | 16.4% | |
| 10 a 15 | 196 | 9.8% | |
| <= 1 | 195 | 9.8% | |
| 15 a 20 | 148 | 7.4% |
| Max length | 7 |
|---|---|
| Mean length | 5.3435 |
| Min length | 4 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
idade_empresa_anos
Numeric
| Distinct count | 1669 |
|---|---|
| Unique (%) | 83.5% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.9455 |
|---|---|
| Minimum | 0.030137 |
| Maximum | 52.115 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.030137 |
|---|---|
| 5-th percentile | 0.47123 |
| Q1 | 2.7007 |
| Median | 6.6466 |
| Q3 | 14.391 |
| 95-th percentile | 30.719 |
| Maximum | 52.115 |
| Range | 52.085 |
| Interquartile range | 11.69 |
Descriptive statistics
| Standard deviation | 9.6906 |
|---|---|
| Coef of variation | 0.97438 |
| Kurtosis | 1.3867 |
| Mean | 9.9455 |
| MAD | 7.6071 |
| Skewness | 1.3767 |
| Sum | 19891 |
| Variance | 93.909 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0.20548 | 7 | 0.4% | |
| 0.20822 | 5 | 0.2% | |
| 0.60548 | 4 | 0.2% | |
| 1.2548 | 4 | 0.2% | |
| 2.3945 | 4 | 0.2% | |
| 3.4603 | 4 | 0.2% | |
| 1.0274 | 4 | 0.2% | |
| 2.2164 | 4 | 0.2% | |
| 3.0603 | 4 | 0.2% | |
| 5.1945 | 4 | 0.2% | |
| Other values (1659) | 1956 | 97.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.030137 | 2 | 0.1% | |
| 0.046575 | 1 | 0.1% | |
| 0.049315 | 1 | 0.1% | |
| 0.052055 | 1 | 0.1% | |
| 0.054795 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 52.115 | 1 | 0.1% | |
| 51.153 | 1 | 0.1% | |
| 49.438 | 1 | 0.1% | |
| 48.279 | 1 | 0.1% | |
| 47.326 | 1 | 0.1% |
idade_maxima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_antiga_ativa and should be ignored for analysis
| Correlation | 0.99976 |
|---|
idade_maxima_socios
Numeric
| Distinct count | 73 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 33.0% |
| Missing (n) | 659 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 44.687 |
|---|---|
| Minimum | 18 |
| Maximum | 96 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 34 |
| Median | 43 |
| Q3 | 54 |
| 95-th percentile | 70 |
| Maximum | 96 |
| Range | 78 |
| Interquartile range | 20 |
Descriptive statistics
| Standard deviation | 13.941 |
|---|---|
| Coef of variation | 0.31197 |
| Kurtosis | -0.071414 |
| Mean | 44.687 |
| MAD | 11.359 |
| Skewness | 0.54313 |
| Sum | 59925 |
| Variance | 194.35 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 41 | 46 | 2.3% | |
| 35 | 43 | 2.1% | |
| 30 | 40 | 2.0% | |
| 32 | 40 | 2.0% | |
| 40 | 39 | 1.9% | |
| 45 | 38 | 1.9% | |
| 47 | 38 | 1.9% | |
| 48 | 37 | 1.8% | |
| 39 | 37 | 1.8% | |
| 33 | 37 | 1.8% | |
| Other values (62) | 946 | 47.3% | |
| (Missing) | 659 | 33.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 18 | 1 | 0.1% | |
| 19 | 4 | 0.2% | |
| 20 | 7 | 0.4% | |
| 21 | 7 | 0.4% | |
| 22 | 14 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 96 | 1 | 0.1% | |
| 90 | 1 | 0.1% | |
| 87 | 2 | 0.1% | |
| 86 | 2 | 0.1% | |
| 85 | 1 | 0.1% |
idade_media_coligadas
Numeric
| Distinct count | 262 |
|---|---|
| Unique (%) | 13.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 159.21 |
|---|---|
| Minimum | 1.7667 |
| Maximum | 634.4 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.7667 |
|---|---|
| 5-th percentile | 37.777 |
| Q1 | 88.3 |
| Median | 148.63 |
| Q3 | 213.65 |
| 95-th percentile | 333.71 |
| Maximum | 634.4 |
| Range | 632.63 |
| Interquartile range | 125.35 |
Descriptive statistics
| Standard deviation | 92.859 |
|---|---|
| Coef of variation | 0.58325 |
| Kurtosis | 2.2092 |
| Mean | 159.21 |
| MAD | 72.529 |
| Skewness | 1.0645 |
| Sum | 41872 |
| Variance | 8622.9 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 243.7 | 2 | 0.1% | |
| 87.644 | 2 | 0.1% | |
| 262.6 | 1 | 0.1% | |
| 85.85 | 1 | 0.1% | |
| 137.88 | 1 | 0.1% | |
| 23.767 | 1 | 0.1% | |
| 154.15 | 1 | 0.1% | |
| 121.41 | 1 | 0.1% | |
| 47.233 | 1 | 0.1% | |
| 16.933 | 1 | 0.1% | |
| Other values (251) | 251 | 12.6% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.7667 | 1 | 0.1% | |
| 5.2667 | 1 | 0.1% | |
| 11.267 | 1 | 0.1% | |
| 16.933 | 1 | 0.1% | |
| 23.767 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 634.4 | 1 | 0.1% | |
| 445.12 | 1 | 0.1% | |
| 407.73 | 1 | 0.1% | |
| 395.33 | 1 | 0.1% | |
| 392.85 | 1 | 0.1% |
idade_media_coligadas_ativas
Highly correlated
This variable is highly correlated with idade_media_coligadas and should be ignored for analysis
| Correlation | 0.99964 |
|---|
idade_media_coligadas_baixadas
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
idade_media_socios
Highly correlated
This variable is highly correlated with idade_maxima_socios and should be ignored for analysis
| Correlation | 0.95544 |
|---|
idade_minima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_ativa and should be ignored for analysis
| Correlation | 1 |
|---|
idade_minima_socios
Highly correlated
This variable is highly correlated with idade_media_socios and should be ignored for analysis
| Correlation | 0.95058 |
|---|
max_faturamento_est_coligados
Highly correlated
This variable is highly correlated with faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.93599 |
|---|
max_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.95994 |
|---|
max_filiais_coligados
Numeric
| Distinct count | 22 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 96.0% |
| Missing (n) | 1919 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 12.383 |
|---|---|
| Minimum | 1 |
| Maximum | 349 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 5 |
| 95-th percentile | 46 |
| Maximum | 349 |
| Range | 348 |
| Interquartile range | 4 |
Descriptive statistics
| Standard deviation | 42.62 |
|---|---|
| Coef of variation | 3.4419 |
| Kurtosis | 50.596 |
| Mean | 12.383 |
| MAD | 17.062 |
| Skewness | 6.7275 |
| Sum | 1003 |
| Variance | 1816.5 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 16 | 0.8% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 8 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 21 | 2 | 0.1% | |
| 4 | 2 | 0.1% | |
| 37 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| Other values (11) | 11 | 0.5% | |
| (Missing) | 1919 | 96.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 16 | 0.8% | |
| 3 | 4 | 0.2% | |
| 4 | 2 | 0.1% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 349 | 1 | 0.1% | |
| 144 | 1 | 0.1% | |
| 67 | 1 | 0.1% | |
| 49 | 1 | 0.1% | |
| 46 | 1 | 0.1% |
max_funcionarios_coligados_gp
Numeric
| Distinct count | 61 |
|---|---|
| Unique (%) | 3.0% |
| Missing (%) | 91.5% |
| Missing (n) | 1830 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 277.26 |
|---|---|
| Minimum | 0 |
| Maximum | 13234 |
| Zeros (%) | 1.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| Median | 7 |
| Q3 | 21 |
| 95-th percentile | 917.5 |
| Maximum | 13234 |
| Range | 13234 |
| Interquartile range | 19 |
Descriptive statistics
| Standard deviation | 1358.5 |
|---|---|
| Coef of variation | 4.8996 |
| Kurtosis | 58.608 |
| Mean | 277.26 |
| MAD | 469.54 |
| Skewness | 7.2586 |
| Sum | 47135 |
| Variance | 1.8454e+06 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 8 | 10 | 0.5% | |
| 3 | 9 | 0.4% | |
| 6 | 9 | 0.4% | |
| 2 | 8 | 0.4% | |
| 4 | 7 | 0.4% | |
| 5 | 7 | 0.4% | |
| 7 | 5 | 0.2% | |
| 12 | 4 | 0.2% | |
| Other values (50) | 69 | 3.5% | |
| (Missing) | 1830 | 91.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 2 | 8 | 0.4% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 13234 | 1 | 0.1% | |
| 7646 | 1 | 0.1% | |
| 7612 | 1 | 0.1% | |
| 3719 | 1 | 0.1% | |
| 3278 | 1 | 0.1% |
max_meses_servicos
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.96649 |
|---|
max_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.96649 |
|---|
max_vl_folha_coligados
Numeric
| Distinct count | 80 |
|---|---|
| Unique (%) | 4.0% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.776e+07 |
|---|---|
| Minimum | 20606 |
| Maximum | 1.0023e+09 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 20606 |
|---|---|
| 5-th percentile | 61819 |
| Q1 | 1.8546e+05 |
| Median | 5.5637e+05 |
| Q3 | 1.3755e+06 |
| 95-th percentile | 4.9337e+07 |
| Maximum | 1.0023e+09 |
| Range | 1.0023e+09 |
| Interquartile range | 1.19e+06 |
Descriptive statistics
| Standard deviation | 9.4405e+07 |
|---|---|
| Coef of variation | 5.3157 |
| Kurtosis | 87.65 |
| Mean | 1.776e+07 |
| MAD | 2.9605e+07 |
| Skewness | 8.8909 |
| Sum | 2.5219e+09 |
| Variance | 8.9124e+15 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 15 | 0.8% | |
| 2.4728e+05 | 7 | 0.4% | |
| 1.2364e+05 | 6 | 0.3% | |
| 1.8546e+05 | 6 | 0.3% | |
| 4.9455e+05 | 4 | 0.2% | |
| 3.297e+05 | 3 | 0.1% | |
| 4.3273e+05 | 3 | 0.1% | |
| 6.1819e+05 | 3 | 0.1% | |
| 3.091e+05 | 3 | 0.1% | |
| 2.0606e+05 | 3 | 0.1% | |
| Other values (69) | 89 | 4.5% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 20606 | 2 | 0.1% | |
| 41213 | 1 | 0.1% | |
| 61819 | 15 | 0.8% | |
| 82426 | 2 | 0.1% | |
| 1.0303e+05 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.0023e+09 | 1 | 0.1% | |
| 4.4242e+08 | 1 | 0.1% | |
| 1.9471e+08 | 1 | 0.1% | |
| 1.3491e+08 | 1 | 0.1% | |
| 1.3087e+08 | 1 | 0.1% |
max_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.90484 |
|---|
media_faturamento_est_coligados
Numeric
| Distinct count | 130 |
|---|---|
| Unique (%) | 6.5% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.3598e+07 |
|---|---|
| Minimum | 50000 |
| Maximum | 1.3219e+09 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 50000 |
|---|---|
| 5-th percentile | 1.8243e+05 |
| Q1 | 2.1e+05 |
| Median | 3.1098e+05 |
| Q3 | 9.3e+05 |
| 95-th percentile | 1.4444e+07 |
| Maximum | 1.3219e+09 |
| Range | 1.3218e+09 |
| Interquartile range | 7.2e+05 |
Descriptive statistics
| Standard deviation | 9.7682e+07 |
|---|---|
| Coef of variation | 7.1837 |
| Kurtosis | 129.93 |
| Mean | 1.3598e+07 |
| MAD | 2.4052e+07 |
| Skewness | 10.665 |
| Sum | 3.5626e+09 |
| Variance | 9.5418e+15 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 95 | 4.8% | |
| 9.3e+05 | 12 | 0.6% | |
| 1.8546e+05 | 6 | 0.3% | |
| 3.7092e+05 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 9.8911e+05 | 3 | 0.1% | |
| 1.6682e+05 | 2 | 0.1% | |
| 2.2864e+05 | 2 | 0.1% | |
| 1.8227e+05 | 2 | 0.1% | |
| 1.2364e+05 | 2 | 0.1% | |
| Other values (119) | 128 | 6.4% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 50000 | 5 | 0.2% | |
| 61819 | 1 | 0.1% | |
| 1.2364e+05 | 2 | 0.1% | |
| 1.5374e+05 | 1 | 0.1% | |
| 1.6682e+05 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.3219e+09 | 1 | 0.1% | |
| 5.1837e+08 | 1 | 0.1% | |
| 5.0287e+08 | 1 | 0.1% | |
| 4.4174e+08 | 1 | 0.1% | |
| 2.3535e+08 | 1 | 0.1% |
media_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.95038 |
|---|
media_filiais_coligados
Numeric
| Distinct count | 30 |
|---|---|
| Unique (%) | 1.5% |
| Missing (%) | 96.0% |
| Missing (n) | 1919 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4.4307 |
|---|---|
| Minimum | 1 |
| Maximum | 67 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1.5 |
| Q3 | 3 |
| 95-th percentile | 14.5 |
| Maximum | 67 |
| Range | 66 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 9.5355 |
|---|---|
| Coef of variation | 2.1522 |
| Kurtosis | 28.736 |
| Mean | 4.4307 |
| MAD | 4.6058 |
| Skewness | 5.0724 |
| Sum | 358.89 |
| Variance | 90.927 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 2 | 11 | 0.5% | |
| 1.5 | 4 | 0.2% | |
| 3 | 3 | 0.1% | |
| 4.5 | 2 | 0.1% | |
| 1.6667 | 1 | 0.1% | |
| 14.5 | 1 | 0.1% | |
| 2.5 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 8.5 | 1 | 0.1% | |
| Other values (19) | 19 | 0.9% | |
| (Missing) | 1919 | 96.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 37 | 1.8% | |
| 1.3333 | 1 | 0.1% | |
| 1.5 | 4 | 0.2% | |
| 1.6667 | 1 | 0.1% | |
| 2 | 11 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 67 | 1 | 0.1% | |
| 49.4 | 1 | 0.1% | |
| 20.667 | 1 | 0.1% | |
| 18 | 1 | 0.1% | |
| 14.5 | 1 | 0.1% |
media_funcionarios_coligados_gp
Numeric
| Distinct count | 71 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 91.5% |
| Missing (n) | 1830 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 53.771 |
|---|---|
| Minimum | 0 |
| Maximum | 2162.4 |
| Zeros (%) | 1.4% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.25 |
| Median | 6 |
| Q3 | 16 |
| 95-th percentile | 195.88 |
| Maximum | 2162.4 |
| Range | 2162.4 |
| Interquartile range | 14.75 |
Descriptive statistics
| Standard deviation | 213.88 |
|---|---|
| Coef of variation | 3.9776 |
| Kurtosis | 63.035 |
| Mean | 53.771 |
| MAD | 80.962 |
| Skewness | 7.3682 |
| Sum | 9141 |
| Variance | 45743 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 1 | 14 | 0.7% | |
| 6 | 10 | 0.5% | |
| 2 | 10 | 0.5% | |
| 3 | 9 | 0.4% | |
| 8 | 8 | 0.4% | |
| 5 | 7 | 0.4% | |
| 4 | 6 | 0.3% | |
| 21 | 6 | 0.3% | |
| 7 | 5 | 0.2% | |
| Other values (60) | 67 | 3.4% | |
| (Missing) | 1830 | 91.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 28 | 1.4% | |
| 0.66667 | 1 | 0.1% | |
| 1 | 14 | 0.7% | |
| 2 | 10 | 0.5% | |
| 3 | 9 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2162.4 | 1 | 0.1% | |
| 1254.8 | 1 | 0.1% | |
| 931 | 1 | 0.1% | |
| 546 | 1 | 0.1% | |
| 479.83 | 1 | 0.1% |
media_meses_servicos
Highly correlated
This variable is highly correlated with max_meses_servicos and should be ignored for analysis
| Correlation | 0.98685 |
|---|
media_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.97656 |
|---|
media_vl_folha_coligados
Numeric
| Distinct count | 96 |
|---|---|
| Unique (%) | 4.8% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4.1188e+06 |
|---|---|
| Minimum | 20606 |
| Maximum | 1.3491e+08 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 20606 |
|---|---|
| 5-th percentile | 61819 |
| Q1 | 1.8546e+05 |
| Median | 4.4304e+05 |
| Q3 | 1.1334e+06 |
| 95-th percentile | 1.1696e+07 |
| Maximum | 1.3491e+08 |
| Range | 1.3489e+08 |
| Interquartile range | 9.4789e+05 |
Descriptive statistics
| Standard deviation | 1.6323e+07 |
|---|---|
| Coef of variation | 3.9631 |
| Kurtosis | 45.659 |
| Mean | 4.1188e+06 |
| MAD | 6.1598e+06 |
| Skewness | 6.5235 |
| Sum | 5.8487e+08 |
| Variance | 2.6646e+14 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 15 | 0.8% | |
| 1.8546e+05 | 6 | 0.3% | |
| 1.2364e+05 | 5 | 0.2% | |
| 2.4728e+05 | 4 | 0.2% | |
| 3.297e+05 | 3 | 0.1% | |
| 3.091e+05 | 3 | 0.1% | |
| 2.0606e+05 | 3 | 0.1% | |
| 4.9455e+05 | 3 | 0.1% | |
| 3.7092e+05 | 3 | 0.1% | |
| 1.0509e+06 | 2 | 0.1% | |
| Other values (85) | 95 | 4.8% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 20606 | 2 | 0.1% | |
| 41213 | 1 | 0.1% | |
| 61819 | 15 | 0.8% | |
| 61819 | 1 | 0.1% | |
| 82426 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.3491e+08 | 1 | 0.1% | |
| 1.163e+08 | 1 | 0.1% | |
| 6.5405e+07 | 1 | 0.1% | |
| 4.1406e+07 | 1 | 0.1% | |
| 2.819e+07 | 1 | 0.1% |
media_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with media_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.90412 |
|---|
meses_ultima_contratacaco
Numeric
| Distinct count | 254 |
|---|---|
| Unique (%) | 12.7% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 40.529 |
|---|---|
| Minimum | 2.0667 |
| Maximum | 325.53 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 2.0667 |
|---|---|
| 5-th percentile | 3.2533 |
| Q1 | 10 |
| Median | 32.733 |
| Q3 | 58.167 |
| 95-th percentile | 106.22 |
| Maximum | 325.53 |
| Range | 323.47 |
| Interquartile range | 48.167 |
Descriptive statistics
| Standard deviation | 37.497 |
|---|---|
| Coef of variation | 0.92518 |
| Kurtosis | 8.3717 |
| Mean | 40.529 |
| MAD | 27.838 |
| Skewness | 2.0336 |
| Sum | 18927 |
| Variance | 1406 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 5.9667 | 13 | 0.7% | |
| 2.9333 | 10 | 0.5% | |
| 40.5 | 9 | 0.4% | |
| 8.0333 | 8 | 0.4% | |
| 26.233 | 7 | 0.4% | |
| 6.9667 | 7 | 0.4% | |
| 39.467 | 7 | 0.4% | |
| 7 | 6 | 0.3% | |
| 23.2 | 6 | 0.3% | |
| 24.2 | 6 | 0.3% | |
| Other values (243) | 388 | 19.4% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 2.0667 | 1 | 0.1% | |
| 2.1333 | 1 | 0.1% | |
| 2.1667 | 1 | 0.1% | |
| 2.3 | 1 | 0.1% | |
| 2.4 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 325.53 | 1 | 0.1% | |
| 222.07 | 1 | 0.1% | |
| 178.4 | 1 | 0.1% | |
| 168.2 | 1 | 0.1% | |
| 167.23 | 1 | 0.1% |
min_faturamento_est_coligados
Numeric
| Distinct count | 42 |
|---|---|
| Unique (%) | 2.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 4.4692e+05 |
|---|---|
| Minimum | 13946 |
| Maximum | 1.4466e+07 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 13946 |
|---|---|
| 5-th percentile | 83804 |
| Q1 | 2.1e+05 |
| Median | 2.1e+05 |
| Q3 | 2.1e+05 |
| 95-th percentile | 1.1127e+06 |
| Maximum | 1.4466e+07 |
| Range | 1.4452e+07 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 1.0523e+06 |
|---|---|
| Coef of variation | 2.3546 |
| Kurtosis | 122.78 |
| Mean | 4.4692e+05 |
| MAD | 4.1037e+05 |
| Skewness | 9.8582 |
| Sum | 1.1709e+08 |
| Variance | 1.1074e+12 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 159 | 8.0% | |
| 9.3e+05 | 15 | 0.8% | |
| 1.8546e+05 | 13 | 0.7% | |
| 1.2364e+05 | 13 | 0.7% | |
| 3.7092e+05 | 6 | 0.3% | |
| 50000 | 5 | 0.2% | |
| 9.8911e+05 | 4 | 0.2% | |
| 2.4728e+05 | 4 | 0.2% | |
| 2.0606e+05 | 3 | 0.1% | |
| 82426 | 2 | 0.1% | |
| Other values (31) | 38 | 1.9% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 13946 | 1 | 0.1% | |
| 19079 | 1 | 0.1% | |
| 41213 | 2 | 0.1% | |
| 50000 | 5 | 0.2% | |
| 51516 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.4466e+07 | 1 | 0.1% | |
| 4.657e+06 | 1 | 0.1% | |
| 3.7916e+06 | 1 | 0.1% | |
| 3.5855e+06 | 1 | 0.1% | |
| 3.194e+06 | 1 | 0.1% |
min_faturamento_est_coligados_gp
Numeric
| Distinct count | 56 |
|---|---|
| Unique (%) | 2.8% |
| Missing (%) | 86.9% |
| Missing (n) | 1738 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.4177e+05 |
|---|---|
| Minimum | 19079 |
| Maximum | 8.1346e+07 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 19079 |
|---|---|
| 5-th percentile | 1.2364e+05 |
| Q1 | 2.1e+05 |
| Median | 2.1e+05 |
| Q3 | 4.2e+05 |
| 95-th percentile | 1.8546e+06 |
| Maximum | 8.1346e+07 |
| Range | 8.1327e+07 |
| Interquartile range | 2.1e+05 |
Descriptive statistics
| Standard deviation | 5.122e+06 |
|---|---|
| Coef of variation | 6.0848 |
| Kurtosis | 236.44 |
| Mean | 8.4177e+05 |
| MAD | 9.9927e+05 |
| Skewness | 15.074 |
| Sum | 2.2054e+08 |
| Variance | 2.6235e+13 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 2.1e+05 | 145 | 7.2% | |
| 9.3e+05 | 14 | 0.7% | |
| 1.2364e+05 | 13 | 0.7% | |
| 1.8546e+05 | 9 | 0.4% | |
| 4.2e+05 | 8 | 0.4% | |
| 3.7092e+05 | 5 | 0.2% | |
| 50000 | 5 | 0.2% | |
| 6.3e+05 | 4 | 0.2% | |
| 2.4728e+05 | 3 | 0.1% | |
| 1.1127e+06 | 3 | 0.1% | |
| Other values (45) | 53 | 2.6% | |
| (Missing) | 1738 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 19079 | 1 | 0.1% | |
| 41213 | 2 | 0.1% | |
| 50000 | 5 | 0.2% | |
| 51516 | 1 | 0.1% | |
| 61819 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8.1346e+07 | 1 | 0.1% | |
| 1.4676e+07 | 1 | 0.1% | |
| 5.6089e+06 | 1 | 0.1% | |
| 4.657e+06 | 1 | 0.1% | |
| 4.451e+06 | 1 | 0.1% |
min_filiais_coligados
Highly correlated
This variable is highly correlated with min_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.94222 |
|---|
min_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with min_filiais_coligados and should be ignored for analysis
| Correlation | 0.90511 |
|---|
min_meses_servicos
Highly correlated
This variable is highly correlated with meses_ultima_contratacaco and should be ignored for analysis
| Correlation | 0.9357 |
|---|
min_meses_servicos_all
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.98306 |
|---|
min_vl_folha_coligados
Numeric
| Distinct count | 47 |
|---|---|
| Unique (%) | 2.4% |
| Missing (%) | 92.9% |
| Missing (n) | 1858 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.4262e+06 |
|---|---|
| Minimum | 0 |
| Maximum | 1.3491e+08 |
| Zeros (%) | 0.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41213 |
| Q1 | 61819 |
| Median | 2.0606e+05 |
| Q3 | 4.9455e+05 |
| 95-th percentile | 1.4218e+06 |
| Maximum | 1.3491e+08 |
| Range | 1.3491e+08 |
| Interquartile range | 4.3273e+05 |
Descriptive statistics
| Standard deviation | 1.1334e+07 |
|---|---|
| Coef of variation | 7.9473 |
| Kurtosis | 139.3 |
| Mean | 1.4262e+06 |
| MAD | 2.1156e+06 |
| Skewness | 11.753 |
| Sum | 2.0252e+08 |
| Variance | 1.2847e+14 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 61819 | 28 | 1.4% | |
| 1.2364e+05 | 12 | 0.6% | |
| 1.8546e+05 | 8 | 0.4% | |
| 3.297e+05 | 6 | 0.3% | |
| 1.0303e+05 | 5 | 0.2% | |
| 20606 | 5 | 0.2% | |
| 2.0606e+05 | 5 | 0.2% | |
| 4.9455e+05 | 4 | 0.2% | |
| 6.1819e+05 | 4 | 0.2% | |
| 2.4728e+05 | 4 | 0.2% | |
| Other values (36) | 61 | 3.0% | |
| (Missing) | 1858 | 92.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 20606 | 5 | 0.2% | |
| 41213 | 3 | 0.1% | |
| 61819 | 28 | 1.4% | |
| 82426 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1.3491e+08 | 1 | 0.1% | |
| 1.0365e+07 | 1 | 0.1% | |
| 7.2328e+06 | 1 | 0.1% | |
| 2.3285e+06 | 1 | 0.1% | |
| 1.9164e+06 | 1 | 0.1% |
min_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with min_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.98176 |
|---|
natureza_juridica_macro
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| OUTROS | |
|---|---|
| ENTIDADES EMPRESARIAIS | |
| ENTIDADES SEM FINS LUCRATIVOS | 137 |
| Other values (3) | 31 |
| Value | Count | Frequency (%) | |
| OUTROS | 1406 | 70.3% | |
| ENTIDADES EMPRESARIAIS | 426 | 21.3% | |
| ENTIDADES SEM FINS LUCRATIVOS | 137 | 6.9% | |
| ADMINISTRACAO PUBLICA | 15 | 0.8% | |
| CARGO POLITICO | 12 | 0.6% | |
| PESSOAS FISICAS | 4 | 0.2% |
| Max length | 29 |
|---|---|
| Mean length | 11.162 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_divisao
Categorical
| Distinct count | 72 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO VAREJISTA | |
|---|---|
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 140 |
| ALIMENTACAO | 121 |
| Other values (68) |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 749 | 37.5% | |
| ATIVIDADES DE ORGANIZACOES ASSOCIATIVAS | 140 | 7.0% | |
| ALIMENTACAO | 121 | 6.0% | |
| COMERCIO E REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 115 | 5.8% | |
| SERVICOS ESPECIALIZADOS PARA CONSTRUCAO | 73 | 3.6% | |
| COMERCIO POR ATACADO EXCETO VEICULOS AUTOMOTORES E MOTOCICLETAS | 63 | 3.1% | |
| OUTRAS ATIVIDADES DE SERVICOS PESSOAIS | 54 | 2.7% | |
| SERVICOS DE ESCRITORIO DE APOIO ADMINISTRATIVO E OUTROS SERVICOS PRESTADOS PRINCIPALMENTE AS EMPRESAS | 52 | 2.6% | |
| EDUCACAO | 48 | 2.4% | |
| CONSTRUCAO DE EDIFICIOS | 42 | 2.1% | |
| Other values (61) | 532 | 26.6% |
| Max length | 120 |
|---|---|
| Mean length | 33.612 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_meso_regiao
Categorical
| Distinct count | 20 |
|---|---|
| Unique (%) | 1.0% |
| Missing (%) | 13.9% |
| Missing (n) | 278 |
| CENTRO AMAZONENSE | |
|---|---|
| LESTE POTIGUAR | |
| NORTE MARANHENSE | |
| Other values (16) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| CENTRO AMAZONENSE | 319 | 16.0% | |
| LESTE POTIGUAR | 264 | 13.2% | |
| NORTE MARANHENSE | 263 | 13.2% | |
| CENTRO NORTE PIAUIENSE | 165 | 8.2% | |
| OESTE MARANHENSE | 101 | 5.1% | |
| VALE DO ACRE | 83 | 4.2% | |
| OESTE POTIGUAR | 78 | 3.9% | |
| LESTE MARANHENSE | 77 | 3.9% | |
| CENTRO MARANHENSE | 59 | 2.9% | |
| SUDOESTE PIAUIENSE | 59 | 2.9% | |
| Other values (9) | 254 | 12.7% | |
| (Missing) | 278 | 13.9% |
| Max length | 22 |
|---|---|
| Mean length | 14.361 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_micro_regiao
Categorical
| Distinct count | 73 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 13.9% |
| Missing (n) | 278 |
| MANAUS | |
|---|---|
| NATAL | |
| AGLOMERACAO URBANA DE SAO LUIS | 206 |
| Other values (69) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| MANAUS | 256 | 12.8% | |
| NATAL | 216 | 10.8% | |
| AGLOMERACAO URBANA DE SAO LUIS | 206 | 10.3% | |
| TERESINA | 142 | 7.1% | |
| RIO BRANCO | 70 | 3.5% | |
| IMPERATRIZ | 56 | 2.8% | |
| MOSSORO | 39 | 1.9% | |
| MEDIO MEARIM | 35 | 1.8% | |
| CAXIAS | 32 | 1.6% | |
| PINDARE | 31 | 1.6% | |
| Other values (62) | 639 | 31.9% | |
| (Missing) | 278 | 13.9% |
| Max length | 33 |
|---|---|
| Mean length | 11.012 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_segmento
Categorical
| Distinct count | 21 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | |
|---|---|
| OUTRAS ATIVIDADES DE SERVICOS | |
| INDUSTRIAS DE TRANSFORMACAO | 140 |
| Other values (17) |
| Value | Count | Frequency (%) | |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 927 | 46.4% | |
| OUTRAS ATIVIDADES DE SERVICOS | 223 | 11.2% | |
| INDUSTRIAS DE TRANSFORMACAO | 140 | 7.0% | |
| ALOJAMENTO E ALIMENTACAO | 133 | 6.7% | |
| CONSTRUCAO | 122 | 6.1% | |
| ATIVIDADES ADMINISTRATIVAS E SERVICOS COMPLEMENTARES | 102 | 5.1% | |
| ATIVIDADES PROFISSIONAIS CIENTIFICAS E TECNICAS | 78 | 3.9% | |
| TRANSPORTE ARMAZENAGEM E CORREIO | 59 | 2.9% | |
| SAUDE HUMANA E SERVICOS SOCIAIS | 50 | 2.5% | |
| EDUCACAO | 48 | 2.4% | |
| Other values (10) | 107 | 5.3% |
| Max length | 62 |
|---|---|
| Mean length | 42.663 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nu_meses_rescencia
Numeric
| Distinct count | 26 |
|---|---|
| Unique (%) | 1.3% |
| Missing (%) | 10.0% |
| Missing (n) | 199 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 25.313 |
|---|---|
| Minimum | 7 |
| Maximum | 54 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 22 |
| Median | 23 |
| Q3 | 25 |
| 95-th percentile | 48 |
| Maximum | 54 |
| Range | 47 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 9.6847 |
|---|---|
| Coef of variation | 0.3826 |
| Kurtosis | 1.8594 |
| Mean | 25.313 |
| MAD | 5.9891 |
| Skewness | 1.2775 |
| Sum | 45588 |
| Variance | 93.794 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 23 | 479 | 23.9% | |
| 22 | 357 | 17.8% | |
| 24 | 197 | 9.8% | |
| 48 | 125 | 6.2% | |
| 25 | 115 | 5.8% | |
| 26 | 113 | 5.7% | |
| 27 | 87 | 4.3% | |
| 21 | 51 | 2.5% | |
| 9 | 48 | 2.4% | |
| 47 | 39 | 1.9% | |
| Other values (15) | 190 | 9.5% | |
| (Missing) | 199 | 10.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 7 | 39 | 1.9% | |
| 8 | 17 | 0.9% | |
| 9 | 48 | 2.4% | |
| 10 | 16 | 0.8% | |
| 11 | 17 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 54 | 10 | 0.5% | |
| 52 | 3 | 0.1% | |
| 50 | 24 | 1.2% | |
| 49 | 20 | 1.0% | |
| 48 | 125 | 6.2% |
percent_func_genero_fem
Highly correlated
This variable is highly correlated with grau_instrucao_macro_analfabeto and should be ignored for analysis
| Correlation | 0.94572 |
|---|
percent_func_genero_masc
Numeric
| Distinct count | 65 |
|---|---|
| Unique (%) | 3.2% |
| Missing (%) | 83.0% |
| Missing (n) | 1659 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 53.985 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 4.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12.5 |
| Median | 57.14 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 87.5 |
Descriptive statistics
| Standard deviation | 39.183 |
|---|---|
| Coef of variation | 0.72581 |
| Kurtosis | -1.4939 |
| Mean | 53.985 |
| MAD | 34.864 |
| Skewness | -0.1815 |
| Sum | 18409 |
| Variance | 1535.3 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 100 | 94 | 4.7% | |
| 0 | 83 | 4.2% | |
| 50 | 28 | 1.4% | |
| 66.67 | 15 | 0.8% | |
| 33.33 | 13 | 0.7% | |
| 75 | 8 | 0.4% | |
| 83.33 | 7 | 0.4% | |
| 60 | 7 | 0.4% | |
| 25 | 7 | 0.4% | |
| 80 | 4 | 0.2% | |
| Other values (54) | 75 | 3.8% | |
| (Missing) | 1659 | 83.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 83 | 4.2% | |
| 11.11 | 2 | 0.1% | |
| 12.5 | 1 | 0.1% | |
| 14.29 | 1 | 0.1% | |
| 16 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 94 | 4.7% | |
| 98.04 | 1 | 0.1% | |
| 93.33 | 2 | 0.1% | |
| 93.24 | 1 | 0.1% | |
| 92.31 | 1 | 0.1% |
qt_admitidos
Numeric
| Distinct count | 85 |
|---|---|
| Unique (%) | 4.2% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 35.056 |
|---|---|
| Minimum | 1 |
| Maximum | 2000 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 6 |
| Q3 | 18 |
| 95-th percentile | 147.8 |
| Maximum | 2000 |
| Range | 1999 |
| Interquartile range | 16 |
Descriptive statistics
| Standard deviation | 144.85 |
|---|---|
| Coef of variation | 4.132 |
| Kurtosis | 124.46 |
| Mean | 35.056 |
| MAD | 46.64 |
| Skewness | 10.325 |
| Sum | 16371 |
| Variance | 20981 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 55 | 2.8% | |
| 3 | 33 | 1.7% | |
| 4 | 30 | 1.5% | |
| 6 | 22 | 1.1% | |
| 5 | 21 | 1.1% | |
| 8 | 17 | 0.9% | |
| 7 | 17 | 0.9% | |
| 9 | 15 | 0.8% | |
| 17 | 11 | 0.5% | |
| Other values (74) | 171 | 8.6% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 75 | 3.8% | |
| 2 | 55 | 2.8% | |
| 3 | 33 | 1.7% | |
| 4 | 30 | 1.5% | |
| 5 | 21 | 1.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 2000 | 1 | 0.1% | |
| 1812 | 1 | 0.1% | |
| 890 | 1 | 0.1% | |
| 687 | 1 | 0.1% | |
| 647 | 1 | 0.1% |
qt_admitidos_12meses
Numeric
| Distinct count | 22 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.2099 |
|---|---|
| Minimum | 0 |
| Maximum | 65 |
| Zeros (%) | 17.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 65 |
| Range | 65 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 4.8411 |
|---|---|
| Coef of variation | 4.0014 |
| Kurtosis | 87.575 |
| Mean | 1.2099 |
| MAD | 1.8304 |
| Skewness | 8.3432 |
| Sum | 565 |
| Variance | 23.437 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 342 | 17.1% | |
| 1 | 65 | 3.2% | |
| 2 | 16 | 0.8% | |
| 3 | 10 | 0.5% | |
| 4 | 7 | 0.4% | |
| 6 | 6 | 0.3% | |
| 12 | 4 | 0.2% | |
| 5 | 2 | 0.1% | |
| 7 | 2 | 0.1% | |
| 14 | 2 | 0.1% | |
| Other values (11) | 11 | 0.5% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 342 | 17.1% | |
| 1 | 65 | 3.2% | |
| 2 | 16 | 0.8% | |
| 3 | 10 | 0.5% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 65 | 1 | 0.1% | |
| 46 | 1 | 0.1% | |
| 35 | 1 | 0.1% | |
| 28 | 1 | 0.1% | |
| 21 | 1 | 0.1% |
qt_alteracao_socio_180d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_365d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_90d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_total
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_art
Numeric
| Distinct count | 8 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 98.7% |
| Missing (n) | 1974 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.8462 |
|---|---|
| Minimum | 1 |
| Maximum | 14 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 3 |
| 95-th percentile | 10.25 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 3.2704 |
|---|---|
| Coef of variation | 1.1491 |
| Kurtosis | 5.8384 |
| Mean | 2.8462 |
| MAD | 2.0947 |
| Skewness | 2.4747 |
| Sum | 74 |
| Variance | 10.695 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 4 | 0.2% | |
| 14 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| (Missing) | 1974 | 98.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 4 | 0.2% | |
| 5 | 1 | 0.1% | |
| 8 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 14 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 3 | 4 | 0.2% |
qt_coligadas
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 89.5% |
| Missing (n) | 1791 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.2919 |
|---|---|
| Minimum | 1 |
| Maximum | 26 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.6883 |
|---|---|
| Coef of variation | 1.173 |
| Kurtosis | 31.346 |
| Mean | 2.2919 |
| MAD | 1.5992 |
| Skewness | 4.5917 |
| Sum | 479 |
| Variance | 7.2269 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 121 | 6.0% | |
| 2 | 37 | 1.8% | |
| 3 | 18 | 0.9% | |
| 5 | 10 | 0.5% | |
| 4 | 6 | 0.3% | |
| 7 | 5 | 0.2% | |
| 6 | 4 | 0.2% | |
| 8 | 3 | 0.1% | |
| 26 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 1791 | 89.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 121 | 6.0% | |
| 2 | 37 | 1.8% | |
| 3 | 18 | 0.9% | |
| 4 | 6 | 0.3% | |
| 5 | 10 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 26 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 13 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 1 | 0.1% |
qt_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.99508 |
|---|
qt_coligados_agropecuaria
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 248 |
|---|---|
| 1 | 11 |
| 2 | 3 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 248 | 12.4% | |
| 1 | 11 | 0.5% | |
| 2 | 3 | 0.1% | |
| 4 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_atividade_alto
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_baixo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_inativo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_medio
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_atividade_mt_baixo
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_ativo
Highly correlated
This variable is highly correlated with qt_coligados and should be ignored for analysis
| Correlation | 0.99225 |
|---|
qt_coligados_baixada
Highly correlated
This variable is highly correlated with min_vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.97761 |
|---|
qt_coligados_ccivil
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.35361 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros (%) | 10.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.93309 |
|---|---|
| Coef of variation | 2.6387 |
| Kurtosis | 16.15 |
| Mean | 0.35361 |
| MAD | 0.57008 |
| Skewness | 3.7275 |
| Sum | 93 |
| Variance | 0.87066 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 212 | 10.6% | |
| 1 | 30 | 1.5% | |
| 2 | 12 | 0.6% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 6 | 2 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 212 | 10.6% | |
| 1 | 30 | 1.5% | |
| 2 | 12 | 0.6% | |
| 3 | 4 | 0.2% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6 | 2 | 0.1% | |
| 5 | 3 | 0.1% | |
| 3 | 4 | 0.2% | |
| 2 | 12 | 0.6% | |
| 1 | 30 | 1.5% |
qt_coligados_centro
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.087452 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros (%) | 12.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.60212 |
|---|---|
| Coef of variation | 6.8851 |
| Kurtosis | 74.317 |
| Mean | 0.087452 |
| MAD | 0.16958 |
| Skewness | 8.314 |
| Sum | 23 |
| Variance | 0.36255 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 3 | 0.1% | |
| 3 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 3 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 2 | 0.1% | |
| 6 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6 | 2 | 0.1% | |
| 3 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 1 | 3 | 0.1% | |
| 0 | 255 | 12.8% |
qt_coligados_comercio
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.9962 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros (%) | 7.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.0611 |
|---|---|
| Coef of variation | 2.069 |
| Kurtosis | 32.125 |
| Mean | 0.9962 |
| MAD | 1.0909 |
| Skewness | 4.7676 |
| Sum | 262 |
| Variance | 4.2481 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 144 | 7.2% | |
| 1 | 70 | 3.5% | |
| 2 | 27 | 1.4% | |
| 3 | 7 | 0.4% | |
| 8 | 3 | 0.1% | |
| 6 | 3 | 0.1% | |
| 4 | 2 | 0.1% | |
| 5 | 2 | 0.1% | |
| 10 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 144 | 7.2% | |
| 1 | 70 | 3.5% | |
| 2 | 27 | 1.4% | |
| 3 | 7 | 0.4% | |
| 4 | 2 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 20 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 10 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| 8 | 3 | 0.1% |
qt_coligados_epp
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_exterior
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 257 |
|---|---|
| 2 | 3 |
| 1 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 257 | 12.8% | |
| 2 | 3 | 0.1% | |
| 1 | 2 | 0.1% | |
| 3 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_inapta
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 258 |
|---|---|
| 1 | 3 |
| 2 | 2 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 3 | 0.1% | |
| 2 | 2 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_industria
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.64259 |
|---|---|
| Minimum | 0 |
| Maximum | 111 |
| Zeros (%) | 11.5% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 6.9029 |
|---|---|
| Coef of variation | 10.742 |
| Kurtosis | 252.12 |
| Mean | 0.64259 |
| MAD | 1.119 |
| Skewness | 15.743 |
| Sum | 169 |
| Variance | 47.65 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 229 | 11.5% | |
| 1 | 24 | 1.2% | |
| 3 | 4 | 0.2% | |
| 2 | 4 | 0.2% | |
| 111 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 229 | 11.5% | |
| 1 | 24 | 1.2% | |
| 2 | 4 | 0.2% | |
| 3 | 4 | 0.2% | |
| 14 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 111 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 3 | 4 | 0.2% | |
| 2 | 4 | 0.2% | |
| 1 | 24 | 1.2% |
qt_coligados_ltda
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 251 |
|---|---|
| 1 | 9 |
| 2 | 3 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 251 | 12.6% | |
| 1 | 9 | 0.4% | |
| 2 | 3 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_matriz
Highly correlated
This variable is highly correlated with qt_coligados_ativo and should be ignored for analysis
| Correlation | 0.99409 |
|---|
qt_coligados_me
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 262 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 262 | 13.1% | |
| 1 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
qt_coligados_mei
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 0 | 255 |
|---|---|
| 1 | 7 |
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 255 | 12.8% | |
| 1 | 7 | 0.4% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_coligados_nordeste
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2 |
|---|---|
| Minimum | 0 |
| Maximum | 36 |
| Zeros (%) | 4.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 36 |
| Range | 36 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 4.0095 |
|---|---|
| Coef of variation | 2.0048 |
| Kurtosis | 34.703 |
| Mean | 2 |
| MAD | 2.038 |
| Skewness | 5.2401 |
| Sum | 526 |
| Variance | 16.076 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 93 | 4.7% | |
| 1 | 82 | 4.1% | |
| 2 | 38 | 1.9% | |
| 3 | 13 | 0.7% | |
| 5 | 8 | 0.4% | |
| 6 | 6 | 0.3% | |
| 7 | 6 | 0.3% | |
| 4 | 5 | 0.2% | |
| 8 | 5 | 0.2% | |
| 16 | 1 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 93 | 4.7% | |
| 1 | 82 | 4.1% | |
| 2 | 38 | 1.9% | |
| 3 | 13 | 0.7% | |
| 4 | 5 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 36 | 1 | 0.1% | |
| 31 | 1 | 0.1% | |
| 25 | 1 | 0.1% | |
| 20 | 1 | 0.1% | |
| 16 | 1 | 0.1% |
qt_coligados_norte
Highly correlated
This variable is highly correlated with idade_ate_18 and should be ignored for analysis
| Correlation | 0.90019 |
|---|
qt_coligados_nula
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_coligados_sa
Highly correlated
This variable is highly correlated with qt_coligados_matriz and should be ignored for analysis
| Correlation | 0.9177 |
|---|
qt_coligados_serviço
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.7681 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros (%) | 4.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 2.8904 |
|---|---|
| Coef of variation | 1.6348 |
| Kurtosis | 14.514 |
| Mean | 1.7681 |
| MAD | 1.8047 |
| Skewness | 3.3526 |
| Sum | 465 |
| Variance | 8.3544 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 96 | 4.8% | |
| 1 | 88 | 4.4% | |
| 2 | 27 | 1.4% | |
| 3 | 15 | 0.8% | |
| 7 | 9 | 0.4% | |
| 4 | 7 | 0.4% | |
| 5 | 6 | 0.3% | |
| 6 | 4 | 0.2% | |
| 8 | 4 | 0.2% | |
| 17 | 1 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 96 | 4.8% | |
| 1 | 88 | 4.4% | |
| 2 | 27 | 1.4% | |
| 3 | 15 | 0.8% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 20 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 17 | 1 | 0.1% | |
| 15 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
qt_coligados_sudeste
Highly correlated
This variable is highly correlated with qt_coligados_sa and should be ignored for analysis
| Correlation | 0.91504 |
|---|
qt_coligados_sul
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 0.045627 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros (%) | 12.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.38805 |
|---|---|
| Coef of variation | 8.5047 |
| Kurtosis | 116.65 |
| Mean | 0.045627 |
| MAD | 0.08952 |
| Skewness | 10.306 |
| Sum | 12 |
| Variance | 0.15058 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 258 | 12.9% | |
| 1 | 2 | 0.1% | |
| 2 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5 | 1 | 0.1% | |
| 3 | 1 | 0.1% | |
| 2 | 1 | 0.1% | |
| 1 | 2 | 0.1% | |
| 0 | 258 | 12.9% |
qt_coligados_suspensa
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_desligados
Numeric
| Distinct count | 76 |
|---|---|
| Unique (%) | 3.8% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 25.101 |
|---|---|
| Minimum | 0 |
| Maximum | 1985 |
| Zeros (%) | 2.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 4 |
| Q3 | 12 |
| 95-th percentile | 107.1 |
| Maximum | 1985 |
| Range | 1985 |
| Interquartile range | 11 |
Descriptive statistics
| Standard deviation | 112.35 |
|---|---|
| Coef of variation | 4.476 |
| Kurtosis | 207.52 |
| Mean | 25.101 |
| MAD | 34.379 |
| Skewness | 12.958 |
| Sum | 11722 |
| Variance | 12623 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 81 | 4.0% | |
| 2 | 57 | 2.9% | |
| 0 | 54 | 2.7% | |
| 4 | 32 | 1.6% | |
| 3 | 26 | 1.3% | |
| 6 | 24 | 1.2% | |
| 5 | 17 | 0.9% | |
| 7 | 15 | 0.8% | |
| 11 | 14 | 0.7% | |
| 8 | 11 | 0.5% | |
| Other values (65) | 136 | 6.8% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 54 | 2.7% | |
| 1 | 81 | 4.0% | |
| 2 | 57 | 2.9% | |
| 3 | 26 | 1.3% | |
| 4 | 32 | 1.6% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 1985 | 1 | 0.1% | |
| 865 | 1 | 0.1% | |
| 585 | 1 | 0.1% | |
| 451 | 2 | 0.1% | |
| 290 | 1 | 0.1% |
qt_desligados_12meses
Numeric
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.469 |
|---|---|
| Minimum | 0 |
| Maximum | 233 |
| Zeros (%) | 16.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 233 |
| Range | 233 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 11.19 |
|---|---|
| Coef of variation | 7.6175 |
| Kurtosis | 395.85 |
| Mean | 1.469 |
| MAD | 2.2111 |
| Skewness | 19.246 |
| Sum | 686 |
| Variance | 125.21 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 332 | 16.6% | |
| 1 | 61 | 3.0% | |
| 2 | 23 | 1.1% | |
| 3 | 17 | 0.9% | |
| 4 | 11 | 0.5% | |
| 6 | 6 | 0.3% | |
| 5 | 4 | 0.2% | |
| 10 | 3 | 0.1% | |
| 9 | 2 | 0.1% | |
| 25 | 2 | 0.1% | |
| Other values (6) | 6 | 0.3% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 332 | 16.6% | |
| 1 | 61 | 3.0% | |
| 2 | 23 | 1.1% | |
| 3 | 17 | 0.9% | |
| 4 | 11 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 233 | 1 | 0.1% | |
| 40 | 1 | 0.1% | |
| 25 | 2 | 0.1% | |
| 24 | 1 | 0.1% | |
| 14 | 1 | 0.1% |
qt_ex_funcionarios
Highly correlated
This variable is highly correlated with qt_desligados and should be ignored for analysis
| Correlation | 1 |
|---|
qt_filiais
Numeric
| Distinct count | 42 |
|---|---|
| Unique (%) | 2.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 13.996 |
|---|---|
| Minimum | 0 |
| Maximum | 9270 |
| Zeros (%) | 90.9% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9270 |
| Range | 9270 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 284.47 |
|---|---|
| Coef of variation | 20.325 |
| Kurtosis | 830.78 |
| Mean | 13.996 |
| MAD | 27.188 |
| Skewness | 27.833 |
| Sum | 27992 |
| Variance | 80925 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1818 | 90.9% | |
| 1 | 87 | 4.3% | |
| 2 | 26 | 1.3% | |
| 3 | 12 | 0.6% | |
| 4 | 8 | 0.4% | |
| 5 | 3 | 0.1% | |
| 8 | 3 | 0.1% | |
| 9 | 3 | 0.1% | |
| 59 | 2 | 0.1% | |
| 7 | 2 | 0.1% | |
| Other values (32) | 36 | 1.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1818 | 90.9% | |
| 1 | 87 | 4.3% | |
| 2 | 26 | 1.3% | |
| 3 | 12 | 0.6% | |
| 4 | 8 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9270 | 1 | 0.1% | |
| 7687 | 1 | 0.1% | |
| 2537 | 1 | 0.1% | |
| 2186 | 1 | 0.1% | |
| 1715 | 1 | 0.1% |
qt_funcionarios
Highly correlated
This variable is highly correlated with idade_de_44_a_48 and should be ignored for analysis
| Correlation | 0.95213 |
|---|
qt_funcionarios_12meses
Highly correlated
This variable is highly correlated with qt_funcionarios and should be ignored for analysis
| Correlation | 0.987 |
|---|
qt_funcionarios_24meses
Highly correlated
This variable is highly correlated with qt_funcionarios_12meses and should be ignored for analysis
| Correlation | 0.98125 |
|---|
qt_funcionarios_coligados
Highly correlated
This variable is highly correlated with qt_coligados_sudeste and should be ignored for analysis
| Correlation | 0.90732 |
|---|
qt_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.97393 |
|---|
qt_funcionarios_grupo
Highly correlated
This variable is highly correlated with qt_filiais and should be ignored for analysis
| Correlation | 0.90038 |
|---|
qt_ramos_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.94424 |
|---|
qt_regioes_coligados
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| 1 | 232 |
|---|---|
| 2 | 22 |
| 4 | 5 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 232 | 11.6% | |
| 2 | 22 | 1.1% | |
| 4 | 5 | 0.2% | |
| 3 | 4 | 0.2% | |
| (Missing) | 1737 | 86.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios
Numeric
| Distinct count | 14 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 25.1% |
| Missing (n) | 502 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.3945 |
|---|---|
| Minimum | 1 |
| Maximum | 37 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 2.15 |
| Maximum | 37 |
| Range | 36 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 1.3499 |
|---|---|
| Coef of variation | 0.96799 |
| Kurtosis | 350.99 |
| Mean | 1.3945 |
| MAD | 0.58837 |
| Skewness | 15.301 |
| Sum | 2089 |
| Variance | 1.8222 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 1117 | 55.9% | |
| 2 | 306 | 15.3% | |
| 3 | 42 | 2.1% | |
| 4 | 12 | 0.6% | |
| 5 | 8 | 0.4% | |
| 6 | 4 | 0.2% | |
| 9 | 2 | 0.1% | |
| 8 | 2 | 0.1% | |
| 19 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| Other values (3) | 3 | 0.1% | |
| (Missing) | 502 | 25.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 1117 | 55.9% | |
| 2 | 306 | 15.3% | |
| 3 | 42 | 2.1% | |
| 4 | 12 | 0.6% | |
| 5 | 8 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 37 | 1 | 0.1% | |
| 19 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 2 | 0.1% |
qt_socios_coligados
Highly correlated
This variable is highly correlated with qt_funcionarios_coligados and should be ignored for analysis
| Correlation | 0.91132 |
|---|
qt_socios_feminino
Numeric
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 68.8% |
| Missing (n) | 1377 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.1091 |
|---|---|
| Minimum | 1 |
| Maximum | 11 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.53281 |
|---|---|
| Coef of variation | 0.48038 |
| Kurtosis | 196.68 |
| Mean | 1.1091 |
| MAD | 0.20113 |
| Skewness | 11.744 |
| Sum | 691 |
| Variance | 0.28389 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 574 | 28.7% | |
| 2 | 40 | 2.0% | |
| 3 | 7 | 0.4% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| (Missing) | 1377 | 68.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 574 | 28.7% | |
| 2 | 40 | 2.0% | |
| 3 | 7 | 0.4% | |
| 5 | 1 | 0.1% | |
| 11 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 11 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 3 | 7 | 0.4% | |
| 2 | 40 | 2.0% | |
| 1 | 574 | 28.7% |
qt_socios_masculino
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 57.0% |
| Missing (n) | 1139 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.2218 |
|---|---|
| Minimum | 1 |
| Maximum | 32 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 32 |
| Range | 31 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 1.2088 |
|---|---|
| Coef of variation | 0.98933 |
| Kurtosis | 491.47 |
| Mean | 1.2218 |
| MAD | 0.38493 |
| Skewness | 19.847 |
| Sum | 1052 |
| Variance | 1.4612 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 747 | 37.4% | |
| 2 | 89 | 4.5% | |
| 3 | 12 | 0.6% | |
| 4 | 6 | 0.3% | |
| 5 | 3 | 0.1% | |
| 6 | 2 | 0.1% | |
| 8 | 1 | 0.1% | |
| 32 | 1 | 0.1% | |
| (Missing) | 1139 | 57.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 747 | 37.4% | |
| 2 | 89 | 4.5% | |
| 3 | 12 | 0.6% | |
| 4 | 6 | 0.3% | |
| 5 | 3 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 32 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 6 | 2 | 0.1% | |
| 5 | 3 | 0.1% | |
| 4 | 6 | 0.3% |
qt_socios_pep
Highly correlated
This variable is highly correlated with qt_socios_masculino and should be ignored for analysis
| Correlation | 0.98678 |
|---|
qt_socios_pf
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 0.93094 |
|---|
qt_socios_pj
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 25.1% |
| Missing (n) | 502 |
| 0 | |
|---|---|
| 1 | 12 |
| 2 | 6 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 1479 | 74.0% | |
| 1 | 12 | 0.6% | |
| 2 | 6 | 0.3% | |
| 3 | 1 | 0.1% | |
| (Missing) | 502 | 25.1% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios_pj_ativos
Highly correlated
This variable is highly correlated with qt_socios_pj and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pj_baixados
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_socios_pj_inaptos
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_socios_pj_nulos
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_socios_pj_suspensos
Constant
This variable is constant and should be ignored for analysis
| Constant value | 0.0 |
|---|
qt_socios_st_regular
Highly correlated
This variable is highly correlated with qt_socios_pf and should be ignored for analysis
| Correlation | 0.96491 |
|---|
qt_socios_st_suspensa
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 99.3% |
| Missing (n) | 1986 |
| 1 | 13 |
|---|---|
| 2 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 13 | 0.7% | |
| 2 | 1 | 0.1% | |
| (Missing) | 1986 | 99.3% |
qt_ufs_coligados
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 86.9% |
| Missing (n) | 1737 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.3384 |
|---|---|
| Minimum | 1 |
| Maximum | 8 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.97072 |
|---|---|
| Coef of variation | 0.72528 |
| Kurtosis | 17.295 |
| Mean | 1.3384 |
| MAD | 0.561 |
| Skewness | 3.8942 |
| Sum | 352 |
| Variance | 0.9423 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 218 | 10.9% | |
| 2 | 27 | 1.4% | |
| 3 | 6 | 0.3% | |
| 5 | 5 | 0.2% | |
| 4 | 4 | 0.2% | |
| 6 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| (Missing) | 1737 | 86.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 218 | 10.9% | |
| 2 | 27 | 1.4% | |
| 3 | 6 | 0.3% | |
| 4 | 4 | 0.2% | |
| 5 | 5 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% | |
| 6 | 1 | 0.1% | |
| 5 | 5 | 0.2% | |
| 4 | 4 | 0.2% |
setor
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| COMERCIO | |
|---|---|
| SERVIÇO | |
| INDUSTRIA | 134 |
| Other values (2) | 136 |
| Value | Count | Frequency (%) | |
| COMERCIO | 927 | 46.4% | |
| SERVIÇO | 792 | 39.6% | |
| INDUSTRIA | 134 | 6.7% | |
| CONSTRUÇÃO CIVIL | 122 | 6.1% | |
| AGROPECUARIA | 14 | 0.7% | |
| (Missing) | 11 | 0.5% |
| Max length | 16 |
|---|---|
| Mean length | 8.1595 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
sg_uf
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| MA | 534 | 26.7% | |
| RN | 415 | 20.8% | |
| AM | 353 | 17.6% | |
| PI | 328 | 16.4% | |
| RO | 266 | 13.3% | |
| AC | 104 | 5.2% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sg_uf_matriz
Categorical
| Distinct count | 18 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| MA | |
|---|---|
| RN | |
| AM | |
| Other values (14) |
| Value | Count | Frequency (%) | |
| MA | 523 | 26.2% | |
| RN | 410 | 20.5% | |
| AM | 347 | 17.3% | |
| PI | 320 | 16.0% | |
| RO | 259 | 13.0% | |
| AC | 99 | 5.0% | |
| DF | 7 | 0.4% | |
| SP | 5 | 0.2% | |
| RJ | 4 | 0.2% | |
| CE | 3 | 0.1% | |
| Other values (7) | 12 | 0.6% | |
| (Missing) | 11 | 0.5% |
| Max length | 3 |
|---|---|
| Mean length | 2.0055 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sum_faturamento_estimado_coligadas
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.99843 |
|---|
total
Highly correlated
This variable is highly correlated with qt_funcionarios_24meses and should be ignored for analysis
| Correlation | 0.99141 |
|---|
total_filiais_coligados
Highly correlated
This variable is highly correlated with qt_socios_coligados and should be ignored for analysis
| Correlation | 0.96516 |
|---|
tx_crescimento_12meses
Numeric
| Distinct count | 69 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 82.9% |
| Missing (n) | 1658 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -3.747 |
|---|---|
| Minimum | -100 |
| Maximum | 216.67 |
| Zeros (%) | 10.5% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -50 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 35.986 |
| Maximum | 216.67 |
| Range | 316.67 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 31.371 |
|---|---|
| Coef of variation | -8.3724 |
| Kurtosis | 10.551 |
| Mean | -3.747 |
| MAD | 16.57 |
| Skewness | 0.50356 |
| Sum | -1281.5 |
| Variance | 984.16 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 210 | 10.5% | |
| -100 | 12 | 0.6% | |
| -33.333 | 10 | 0.5% | |
| -50 | 9 | 0.4% | |
| 50 | 8 | 0.4% | |
| -25 | 7 | 0.4% | |
| -14.286 | 5 | 0.2% | |
| 25 | 5 | 0.2% | |
| 100 | 3 | 0.1% | |
| 33.333 | 3 | 0.1% | |
| Other values (58) | 70 | 3.5% | |
| (Missing) | 1658 | 82.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 12 | 0.6% | |
| -93.827 | 1 | 0.1% | |
| -83.333 | 1 | 0.1% | |
| -62.5 | 1 | 0.1% | |
| -60 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 216.67 | 1 | 0.1% | |
| 128.57 | 1 | 0.1% | |
| 100 | 3 | 0.1% | |
| 84.211 | 1 | 0.1% | |
| 57.143 | 1 | 0.1% |
tx_crescimento_24meses
Numeric
| Distinct count | 95 |
|---|---|
| Unique (%) | 4.8% |
| Missing (%) | 82.3% |
| Missing (n) | 1646 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -13.952 |
|---|---|
| Minimum | -100 |
| Maximum | 600 |
| Zeros (%) | 6.6% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -100 |
| Q1 | -42.262 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 68.689 |
| Maximum | 600 |
| Range | 700 |
| Interquartile range | 42.262 |
Descriptive statistics
| Standard deviation | 64.885 |
|---|---|
| Coef of variation | -4.6505 |
| Kurtosis | 28.544 |
| Mean | -13.952 |
| MAD | 37.592 |
| Skewness | 3.7693 |
| Sum | -4939.2 |
| Variance | 4210.1 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 131 | 6.6% | |
| -100 | 39 | 1.9% | |
| -50 | 17 | 0.9% | |
| -25 | 12 | 0.6% | |
| -33.333 | 11 | 0.5% | |
| 100 | 8 | 0.4% | |
| -20 | 8 | 0.4% | |
| -66.667 | 7 | 0.4% | |
| -40 | 6 | 0.3% | |
| 200 | 4 | 0.2% | |
| Other values (84) | 111 | 5.5% | |
| (Missing) | 1646 | 82.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 39 | 1.9% | |
| -94.737 | 1 | 0.1% | |
| -90.909 | 1 | 0.1% | |
| -87.5 | 1 | 0.1% | |
| -85.714 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 600 | 1 | 0.1% | |
| 400 | 1 | 0.1% | |
| 300 | 1 | 0.1% | |
| 200 | 4 | 0.2% | |
| 125 | 1 | 0.1% |
tx_rotatividade
Numeric
| Distinct count | 69 |
|---|---|
| Unique (%) | 3.5% |
| Missing (%) | 76.6% |
| Missing (n) | 1533 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 8.1273 |
|---|---|
| Minimum | 0 |
| Maximum | 200 |
| Zeros (%) | 18.1% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 40 |
| Maximum | 200 |
| Range | 200 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 22.028 |
|---|---|
| Coef of variation | 2.7103 |
| Kurtosis | 29.054 |
| Mean | 8.1273 |
| MAD | 12.694 |
| Skewness | 4.6626 |
| Sum | 3795.4 |
| Variance | 485.22 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 362 | 18.1% | |
| 33.333 | 9 | 0.4% | |
| 28.571 | 8 | 0.4% | |
| 40 | 7 | 0.4% | |
| 25 | 4 | 0.2% | |
| 15.385 | 2 | 0.1% | |
| 16.667 | 2 | 0.1% | |
| 23.529 | 2 | 0.1% | |
| 11.765 | 2 | 0.1% | |
| 90.909 | 2 | 0.1% | |
| Other values (58) | 67 | 3.4% | |
| (Missing) | 1533 | 76.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 362 | 18.1% | |
| 2.9412 | 1 | 0.1% | |
| 4 | 1 | 0.1% | |
| 4.6512 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 200 | 1 | 0.1% | |
| 190.98 | 1 | 0.1% | |
| 150 | 1 | 0.1% | |
| 133.33 | 1 | 0.1% | |
| 90.909 | 2 | 0.1% |
vl_faturamento_estimado_aux
Highly correlated
This variable is highly correlated with total and should be ignored for analysis
| Correlation | 0.98498 |
|---|
vl_faturamento_estimado_grupo_aux
Highly correlated
This variable is highly correlated with qt_socios_st_suspensa and should be ignored for analysis
| Correlation | 0.94871 |
|---|
vl_folha_coligados
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 0.94709 |
|---|
vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with vl_folha_coligados and should be ignored for analysis
| Correlation | 0.92398 |
|---|
vl_frota
Numeric
| Distinct count | 112 |
|---|---|
| Unique (%) | 5.6% |
| Missing (%) | 94.2% |
| Missing (n) | 1885 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.0161e+05 |
|---|---|
| Minimum | 1680 |
| Maximum | 7.8304e+05 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1680 |
|---|---|
| 5-th percentile | 4225.6 |
| Q1 | 29530 |
| Median | 57476 |
| Q3 | 1.0689e+05 |
| 95-th percentile | 3.5427e+05 |
| Maximum | 7.8304e+05 |
| Range | 7.8136e+05 |
| Interquartile range | 77360 |
Descriptive statistics
| Standard deviation | 1.3409e+05 |
|---|---|
| Coef of variation | 1.3196 |
| Kurtosis | 9.3441 |
| Mean | 1.0161e+05 |
| MAD | 86046 |
| Skewness | 2.8708 |
| Sum | 1.1685e+07 |
| Variance | 1.7979e+10 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 72838 | 2 | 0.1% | |
| 98819 | 2 | 0.1% | |
| 39289 | 2 | 0.1% | |
| 41298 | 2 | 0.1% | |
| 76140 | 1 | 0.1% | |
| 8069 | 1 | 0.1% | |
| 1.6912e+05 | 1 | 0.1% | |
| 2.4183e+05 | 1 | 0.1% | |
| 71556 | 1 | 0.1% | |
| 7.8304e+05 | 1 | 0.1% | |
| Other values (101) | 101 | 5.1% | |
| (Missing) | 1885 | 94.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1680 | 1 | 0.1% | |
| 2429 | 1 | 0.1% | |
| 3306 | 1 | 0.1% | |
| 3375 | 1 | 0.1% | |
| 3392 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 7.8304e+05 | 1 | 0.1% | |
| 6.5655e+05 | 1 | 0.1% | |
| 5.6122e+05 | 1 | 0.1% | |
| 5.377e+05 | 1 | 0.1% | |
| 4.8123e+05 | 1 | 0.1% |
vl_idade_maxima_socios_pj
Numeric
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 99.1% |
| Missing (n) | 1981 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10.944 |
|---|---|
| Minimum | 1.3142 |
| Maximum | 30.064 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.3142 |
|---|---|
| 5-th percentile | 3.9877 |
| Q1 | 6.9774 |
| Median | 9.8782 |
| Q3 | 12.557 |
| 95-th percentile | 24.01 |
| Maximum | 30.064 |
| Range | 28.75 |
| Interquartile range | 5.5797 |
Descriptive statistics
| Standard deviation | 6.8002 |
|---|---|
| Coef of variation | 0.62138 |
| Kurtosis | 2.6795 |
| Mean | 10.944 |
| MAD | 4.7569 |
| Skewness | 1.4634 |
| Sum | 207.93 |
| Variance | 46.242 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 9.8782 | 2 | 0.1% | |
| 23.337 | 1 | 0.1% | |
| 6.1191 | 1 | 0.1% | |
| 15.343 | 1 | 0.1% | |
| 4.5941 | 1 | 0.1% | |
| 30.064 | 1 | 0.1% | |
| 13.374 | 1 | 0.1% | |
| 5.8371 | 1 | 0.1% | |
| 8.8843 | 1 | 0.1% | |
| 8.2656 | 1 | 0.1% | |
| Other values (8) | 8 | 0.4% | |
| (Missing) | 1981 | 99.1% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.3142 | 1 | 0.1% | |
| 4.2847 | 1 | 0.1% | |
| 4.5941 | 1 | 0.1% | |
| 5.8371 | 1 | 0.1% | |
| 6.1191 | 1 | 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 30.064 | 1 | 0.1% | |
| 23.337 | 1 | 0.1% | |
| 16.575 | 1 | 0.1% | |
| 15.343 | 1 | 0.1% | |
| 13.374 | 1 | 0.1% |
vl_idade_media_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_maxima_socios_pj and should be ignored for analysis
| Correlation | 0.98045 |
|---|
vl_idade_minima_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_media_socios_pj and should be ignored for analysis
| Correlation | 0.98102 |
|---|
vl_potenc_cons_oleo_gas
Highly correlated
This variable is highly correlated with vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.99995 |
|---|
vl_total_tancagem
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_tancagem_grupo
Highly correlated
This variable is highly correlated with vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_antt
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_antt_grupo
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_leves
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 93.2% |
| Missing (n) | 1864 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.6103 |
|---|---|
| Minimum | 0 |
| Maximum | 24 |
| Zeros (%) | 1.6% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.6025 |
|---|---|
| Coef of variation | 1.6162 |
| Kurtosis | 41.789 |
| Mean | 1.6103 |
| MAD | 1.3501 |
| Skewness | 5.5447 |
| Sum | 219 |
| Variance | 6.7729 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 1 | 66 | 3.3% | |
| 0 | 32 | 1.6% | |
| 2 | 18 | 0.9% | |
| 3 | 7 | 0.4% | |
| 4 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 11 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 24 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| Other values (2) | 2 | 0.1% | |
| (Missing) | 1864 | 93.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 32 | 1.6% | |
| 1 | 66 | 3.3% | |
| 2 | 18 | 0.9% | |
| 3 | 7 | 0.4% | |
| 4 | 4 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 24 | 1 | 0.1% | |
| 11 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| 8 | 1 | 0.1% | |
| 7 | 1 | 0.1% |
vl_total_veiculos_leves_grupo
Numeric
| Distinct count | 35 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 0.5% |
| Missing (n) | 11 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 21.327 |
|---|---|
| Minimum | 0 |
| Maximum | 35064 |
| Zeros (%) | 91.7% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 35064 |
| Range | 35064 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 788.34 |
|---|---|
| Coef of variation | 36.965 |
| Kurtosis | 1966.9 |
| Mean | 21.327 |
| MAD | 41.802 |
| Skewness | 44.237 |
| Sum | 42419 |
| Variance | 6.2147e+05 |
| Memory size | 111.2 KiB |
| Value | Count | Frequency (%) | |
| 0 | 1834 | 91.7% | |
| 1 | 71 | 3.5% | |
| 2 | 26 | 1.3% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% | |
| 8 | 5 | 0.2% | |
| 5 | 4 | 0.2% | |
| 88 | 2 | 0.1% | |
| 6 | 2 | 0.1% | |
| 18 | 2 | 0.1% | |
| Other values (24) | 27 | 1.4% | |
| (Missing) | 11 | 0.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1834 | 91.7% | |
| 1 | 71 | 3.5% | |
| 2 | 26 | 1.3% | |
| 3 | 9 | 0.4% | |
| 4 | 7 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 35064 | 1 | 0.1% | |
| 2134 | 1 | 0.1% | |
| 888 | 1 | 0.1% | |
| 782 | 1 | 0.1% | |
| 479 | 1 | 0.1% |
vl_total_veiculos_pesados
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.91186 |
|---|
vl_total_veiculos_pesados_grupo
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 0.90934 |
|---|